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METHOD EVOLVED FOR RECOGNITION OF THROMBOPfflLIA (MERT) 



FIELD 

This application relates to methods of predicting an individual's genetic 
5 susceptibility to venous thrombosis, as well as kits that can be used to practice the 
disclosed methods. 



BACKGROUND 

Venous thrombosis affects 1 per 1000 individuals annually and is one of the 
1 0 leading causes of mortality and morbidity resulting in approximately 300,000 

hospitalizations and 50,000 fatalities per year in the United States alone (Rosendaal, 
Thromb. Haemost. 78:1-6, 1997; Nordstrom a/., J. Inter. Med. 232:155-60, 1992; 
and Hansson et al. , Arch. Intern. Med. 157:1 665-70, 1 997). 

Numerous conditions predispose an individual to venous thrombosis. Examples 
1 5 of such risk factors include pregnancy, puerperium, use of oral contraceptives and/or 
hormone replacement therapy, trauma, surgery, fractures, prolonged inmiobilization, 
advanced age, antiphospholipid antibodies, previous thrombosis history, 
myeloproliferative disorders, malignancy, and mild-to-moderate hyperhomocysteinemia 
(Abramson et al.. Southern Med. J. 94:1013-20, 2001; and Seligsohn and Lubetsky, N. 
20 Engl. J. Med 344:1222-31, 2001). Venous thrombosis often occurs in the lower leg as 
a deep venous thrombosis (DVT) which often leads to pulmonary emboli that are often 
fatal. If is particularly unfortunate that such thromboembolic phenomena often occur in 
already physiologically compromised patients. 

In addition to acquired risk factors for venous thrombosis, a nimiber of 
25 seemingly monogenic, autosomal dominant, variably penetrant genetic mutations or 
polymorphisms impart an increased risk for venous thrombosis. Examples of such 
mutations and/or polymporphisms are in genes that encode procoagulant proteins 
(Factor V, prothrombin B and fibrinogen), natural anticoagulant proteins (protein C, 
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protein S and antithrombin III) and other thrombosis related proteins (angiotensin-I 
converting en2yme and methylenetetrahydrofolate reductase). Therefore, venous 
thrombosis is a complex genetic disorder. Genetic defects leading to hyperactivity of 
the coagulation system are present in a large proportion of patients with venous 
thrombosis. More than 60% of the predisposition to thrombosis is attributable to 
genetic components (Souto etal.,Am. J. Hum. Genet. 67:1452-9, 2000). 

Previous reports describe screening for one or more polymorphisms associated 
with thrombosis, for example by using PGR (Harrington et al, Clin. Chem. Lab. Med. 
41 :496-500, 2003), or microplate array diagonal gel electrophoresis (Bauer et al., 
Thromb. Haemost. 84:396-400, 2000). Although the use of microarray technology to 
screen for mutations in particular genes involved in venous thrombosis has been 
proposed, these microarrays are limited because they have a low predictive value and 
only detect mutations that are prevalent in Caucasian populations (for example see 
Pecheniuk et al.. Blood Coagul. Fibrinolysis 1 1 :683-700, 2000; PoUak et al., Ital. 
Heart J. 2:568-72, 2001; Evans and Lee-Tataseo, Clin. Chem. 48:1406-1 1, 2002; 
Schrijver et al.. Am. J. Clin. Pathol. 1 19:490-6, 2003; Erali et al, Clin. Chem. 49:732- 
9, 2003). Others have suggested that microarray technology needs to undergo further 
development before it is available for screening the numerous genetic mutations and 
polymorphisms involved in thrombosis (Grody, ^/inw. Rev. Med., 54:473-90, 2003). 

Therefore, there is a need for a method that can accurately predict the risk of an 
individual for developing venous thrombosis, which can be used to screen multiple 
ethnic populations. 

SUMMARY 

Although venous thrombosis is one of the leading causes of morbidity and 
mortality in developed countries, it is an avoidable disease by the use of prophylacitc 
treatment with currently available anticoagulants such as unfractionated heparin, low 
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molecular-weight heparins, aspirin, and coumadin/warfarin. Thus, it is beneficial to 
estimate the individual thrombotic risk to develop stratification protocols for an 
individual risk-adapted prophylaxis to avoid the development of venous 
thromboembolism. For this stratification, the individual risk associated with single or 
combined risk factors of hemostasis is estimated. 

The inventors have identified combinations of mutations and polymorphisms in 
venous thrombosis-related molecules that allow one to predict the genetic susceptibility 
of an individual to developing venous thrombosis with high accuracy in several ethnic 
populations. For example, the disclosed statistical analysis regarding concurrent testing 
of at least ten venous thrombosis associated genetic variations using the disclosed 
method demonstrated that the prediction of venous thrombosis is as accurate as at leeist 
99% in Caucasians, at least 88% in Asians, and at least 91% in African populations. , 
The disclosed methods, herein termed methods evolved for recognition of 
thrombophilia (MERT), provide a rapid and cost-effective assay that allows for 
concurrent genetic testing in several venous thrombosis-associated susceptibility 
molecules, for example factor V, prothrombin (factor II), fibrinogen, protein C, protein 
S, antithrombin, angiotensin I-converting en2yme (ACE), and 

methylenetetrahydrofolate reductase (MTHFR). In one example, the method includes 
determining whether a subject has one or more mutations, polymorphisms, or both, in 
venous thrombosis-associated molecules that comprise, consist essentially of, or consist 
of, sequences fi-om factor V, factor II, fibrinogen, protein C, protein S, antithrombin, 
ACE, and MTHFR. In one example, asymptomatic individuals are screened before or 
during their exposure to high risk situations that provoke thrombosis, such as 
pregnancy, puerperium, use of oral contraceptives or hormone replacement therapy, 
previous thrombosis history, prolonged immobilization, myeloproliferative disorders, 
malignancy, surgery, bone fi-acture, advanced age, antiphospholipid antibodies, or 
combinations thereof 
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Although there are some already existing tests for screening up to six 
thrombophilia susceptibility single nucleotide polymorphisms, they have limited 
potential and a maximum predictive value of 1 .7%. Such tests have a screening 
capacity that only allows them to detect single nucleotide polymorphisms (SNPs) that 
5 are prevalent only in Caucasian populations (Erali et al, Clin. Chem. 49:5, 2003; Evans 
et al., Clin. Chem 48:1406-1 1, 2002). In contrast, the methods and arrays disclosed 
herein are the first offering a highly accurate venous thrombosis prediction by screening 
not only for SNPs but also for insertions and deletions in diverse populations. 

In a particular example, the method includes amplifying nucleic acid molecules 

1 0 obtained from a subject to obtain amplification products. The amplification products 
can comprise, consist essentially of, or consist of, sequences from factor V, prothrombin 
(factor II), fibrinogen, protein C, protein S, antithrombin III, angiotensin I-converting 
enzyme (ACE), and methylenetetrahydrofolate reductase (MTHFR) genes. The 
resulting amplification products are contacted with or applied to an array. The array 

1 5 includes oligonucleotide probes capable of hybridizing to factor V, factor II, fibrinogen, 
protem C, protein S, antithrombin III, ACE, and MTHFR sequences that include one or 
more mutations, polymorphisms, or both. Examples of particular mutations and 
polymorphisms are provided in Table 1 . In some examples, the array fiirther includes 
oligonucleotides capable of hybridizmg to wild-type factor V, wild-type factor II, wild- 

20 type fibrinogen, wild-type protein C, wild-type protein S, wild-type antithrombin III, 
wild-type ACE, and wild-type MTHFR. The amplification products are incubated with 
the array for a time sufficient to allow hybridization between the amplification products 
and oligonucleotide probes, thereby forming amplification products:oligonucleotide 
probe complexes. The amplification products:oligonucleotide probe complexes are then 

25 analyzed to determine if the amplification products include one or more mutations, 
polymorphisms, or both, in factor V, factor II, fibrinogen, protein C, protein S, 
antithrombin III, ACE, or MTHFR. The presence of one or more mutations indicates 
that the subject has a genetic predisposition for venous thrombosis. In particular 
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examples, the presence of more than one mutation or polymorphism indicates that the 
subject is at a greater risk for venous thrombosis than is a subject having only one 
mutation/polymorphism. 

The disclosed method can accurately assess the risk of developing venous 
5 thrombosis and thereby lead to avoiding venous thrombosis, for example by initiating 
appropriate prophylactic therapies in appropriate circumstances. The results presented 
herein demonstrate that concurrent use of a panel of genetic tests for at least eight 
molecules associated with venous thrombosis increases the positive predictive value 
more than 40 fold, when used for detecting venous thrombosis or a predisposition to its 

10 development. Therefore, methods of selecting venous thrombosis therapy are disclosed, 
which include detecting a mutation, polymorphism, or both (such as a substitution, 
deletion or insertion) in at least one VT-related molecule of a subject, using the methods 
disclosed herein and if such mutation, polymorphism, or both, is identified, selecting a 
treatment to avoid venous thrombosis, delay the onset of venous thrombosis, or 

1 5 minimize its consequences. 

Also disclosed are arrays capable of rapid, cost-effective multiple genetic testing 
for venous thrombosis susceptibility. Such arrays include oligonucleotides that are 
complementary to factor V, factor II, fibrinogen, protein C, protein S, antithrombin III, 
ACE, and MTHFR wild-type or mutated sequences, or both. Kits including such arrays 

20 for detecting a genetic predisposition to venous thrombosis in a subject are also 
disclosed. 

The foregoing and other features and advantages of the disclosure will become 
more apparent from the following detailed description of a several embodiments. 



25 



SEQUENCE LISTING 
The nucleic acid sequences listed in the accompanying sequence listing are 
shown using standard letter abbreviations for nucleotide bases. Only one strand of each 
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nucleic acid sequence is shown, but the complementary strand is understood as included 
by any reference to the displayed strand. 

SEQ ID NO: 1 is an oligonucleotide sequence that can be used to probe for a 
wild-type factor V sequence at nucleotide position 1691G. 

SEQ ID NO: 2 is an oligonucleotide sequence that can be used to probe for a 
1691G/A replacement in factor V. 

SEQ ID NO: 3 is an oligonucleotide sequence that can be used to probe for a 
wild-type factor V sequence at nucleotide position 1628G. 

SEQ ID NO: 4 is an oligonucleotide sequence that can be used to probe for a 
1628G/A replacement in factor V. 

SEQ ID NO: 5 is an oligonucleotide sequence that can be used to probe for a 
wild-type factor V sequence at nucleotide position 4070A. 

SEQ ID NO: 6 is an oligonucleotide sequence that can be used to probe for a 
4070A/G replacement in factor V. 

SEQ ID NO: 7 is an oligonucleotide sequence that can be used to probe for a 
wild-type factor V sequence at nucleotide position 1090A. 

SEQ ID NO: 8 is an oligonucleotide sequence that can be used to probe for a 
1090A/G replacement in factor V. 

SEQ ID NO: 9 is an oligonucleotide sequence that can be used to probe for a 
wild-type factor V sequence at nucleotide position 1 09 1 G. 

SEQ ID NO: 10 is an oligonucleotide sequence that can be used to probe for a 
1091G/C replacement in factor V. 

SEQ ID NO: 1 1 is an oligonucleotide sequence that can be used to probe for a 
wild-type prothrombin sequence at nucleotide position 202 lOG. 

SEQ ID NO: 12 is an oligonucleotide sequence that can be used to probe for a 
20210G/A replacement in prothrombin. 



WDN:SLR 4239-66342 Express Mail No. EV33 1 582525US 

Date of Deposit: January 15, 2004 

-7- 

SEQ ID NO: 13 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position a(16)Arg: CGT. 

SEQ ID NO: 14 is an oligonucleotide sequence that can be used to probe for a 
a(16)Arg/Cys: CGT/TGT replacement in fibrinogen. 
5 SEQ ID NO: 1 5 is an oligonucleotide sequence that can be used to probe for a 

wild-type fibrinogen sequence at position a(16)Arg: CGT. 

SEQ ID NO: 16 is an oligonucleotide sequence that can be used to probe for a 
a(16)Arg/His: CGT/CAT replacement in fibrinogen. 

SEQ ID NO: 17 is an oligonucleotide sequence that can be used to probe for a 
10 wild-type fibrinogen sequence at position a(19)Arg: AGG. 

SEQ ID NO: 1 8 is an oligonucleotide sequence that can be used to probe for a 
a(19)Arg/Gly: AGG/GGG replacement in fibrinogen. 

SEQ ID NO: 1 9 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position a(461)Lys: AAA. 
1 5 SEQ ID NO: 20 is an oligonucleotide sequence that can be used to probe for a 

a(461)Lys/Stop: AAA/TAA replacement in fibrinogen. 

SEQ ID NO: 21 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position a(554)Arg: CGT. 

SEQ ID NO: 22 is an oligonucleotide sequence that can be used to probe for a 
20 a(554)Arg/Cys: CGT/TGT replacement in fibrinogen. 

SEQ ID NO: 23 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position P(14)Arg, CGT. 

SEQ ID NO: 24 is an oligonucleotide sequence that can be used to probe for a 
|3(14)Arg/Cys: CGT/TGT replacement in fibrinogen. 
25 SEQ ID NO: 25 is an oligonucleotide sequence that can be used to probe for a 

wild-type fibrinogen sequence at position P(68)Ala, GCT. 
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SEQ ID NO: 26 is an oligonucleotide sequence that can be used to probe for a 
p(68)Ala/Thr: GCT/ACT replacement in fibrinogen. 

SEQ ID NO: 27 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position P(255)Arg, CGT. 
5 SEQ ID NO: 28 is an oligonucleotide sequence that can be used to probe for a 

p(255)Arg/Cys: CGT/TGT replacement in fibrinogen. 

SEQ ID NO: 29 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position Y(275)Arg, CGC. 

SEQ ID NO: 30 is an oligonucleotide sequence that can be used to probe for a 
1 0 Y(275)Arg/Cys: CGC/TGC replacement in fibrinogen. 

SEQ ID-NO: 31 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position Y(275)Arg, CGC. 

SEQ ID NO: 32 is an oligonucleotide sequence that can be used to probe for a 
Y(275)Arg/His: CGC/CAC replacement in fibrinogen. 
1 5 SEQ ID NO: 33 is an oligonucleotide sequence that can be used to probe for a 

wild-type fibrinogen sequence at position Y(292)Gly, GGC. 

SEQ ID NO: 34 is an oligonucleotide sequence that can be used to probe for a 
Y(292)GlyA^al: GGC/GTC replacement in fibrinogen. 

SEQ ID NO: 35 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type fibrinogen sequence at position Y(308)Asn, AAT. 

SEQ ID NO: 36 is an oligonucleotide sequence that can be used to probe for a 
Y(308)Asn/Lys: AAT/AAG replacement in fibrinogen. 

SEQ ID NO: 37 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position Y(318)Asp, GAC. 
25 SEQ ID NO: 38 is an oligonucleotide sequence that can be used to probe for a 

Y(318)Asp/Gly: GAC/GGC replacement in fibrinogen. 

SEQ ID NO: 39 is an oligonucleotide sequence that can be used to probe for a 
wild-type fibrinogen sequence at position Thr312, ACT. 
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SEQ ID NO: 40 is an oligonucleotide sequence that can be used to probe for a 
Thr312Ala: ACT/GCT replacement in fibrinogen. 

SEQ ID NO: 41 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 41G. 

SEQ ID NO: 42 is an oligonucleotide sequence that can be used to probe for a 
41G/A replacement in protein C. 

SEQ ID NO: 43 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 1357C. 

SEQ ID NO: 44 is an oligonucleotide sequence that can be used to probe for a 
1357 C/T replacement in protein C. 

SEQ ID NO: 45 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 1381C. 

SEQ ID NO: 46 is an oligonucleotide sequence that can be used to probe for a 
1381 C/T replacement in protein C. 

SEQ ID NO: 47 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 31 03C. 

SEQ ID NO: 48 is an oligonucleotide sequence that can be used to probe for a 
3103 C/T replacement in protein C. 

SEQ ID NO: 49 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 3 1 69 T. 

SEQ ID NO: 50 is an oligonucleotide sequence that can be used to probe for a 
3169 T/C replacement in protein C. 

SEQ ID NO: 51 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 32 1 7 G. 

SEQ ID NO: 52 is an oligonucleotide sequence that can be used to probe for a 
3217 G/T replacement in protein C. 

SEQ ID NO: 53 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 3222 G. 
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SEQ ID NO: 54 is an oligonucleotide sequence that can be used to probe for a 
3222 G/A replacement in protein C. 

SEQ ID NO: 55 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 3222 G. 
5 SEQ ID NO: 56 is an oligonucleotide sequence that can be used to probe for a 

3222 G/T replacement in protein C. 

SEQ ID NO: 57 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 3359G. 

SEQ ID NO: 58 is an oUgonucleotide sequence that can be used to probe for a 
1 0 3359 G/A replacement in protein C. 

SEQ ID NO: 59 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 3360 C. 

SEQ ID NO: 60 is an oligonucleotide sequence that can be used to probe for a 
3360 C/A replacement in protein C. 
1 5 SEQ ID NO: 61 is an oHgonucleotide sequence that can be used to probe for a 

wild-type protein C sequence at nucleotide position 3363/3364 in protein C. 

SEQ ID NO: 62 is an oligonucleotide sequence that can be used to probe for a 
3363/4 insC in protein C. 

SEQ ID-NO: 63 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein C sequence at nucleotide position 3439 C. 

SEQ ID NO: 64 is an oUgonucleotide sequence that can be used to probe for a 
3439 C/T replacement in protein C. 

SEQ ID NO: 65 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6128 T. 
25 SEQ ID NO: 66 is an oligonucleotide sequence that can be used to probe for a 

6128 T/C replacement in protein C. 

SEQ ID NO: 67 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6152 C. 
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SEQ ID NO: 68 is an oligonucleotide sequence that can be used to probe for a 
6152 C/T replacement in protein C. 

SEQ ID NO: 69 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6182 C. 

SEQ ID NO: 70 is an oligonucleotide sequence that can be used to probe for a 
6 1 82 C/T replacement in protein C. 

SEQ ID NO: 71 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6216 C. 

SEQ ID NO: 72 is an oligonucleotide sequence that can be used to probe for a 
62 1 6 C/T replacement in protein C. 

SEQ ID NO: 73 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6245 C. 

SEQ ID NO: 74 is an oligonucleotide sequence that can be used to probe for a 

6245 C/T replacement in protein C. 

SEQ ID NO: 75 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6246 G. 

SEQ ID NO: 76 is an oligonucleotide sequence that can be used to probe for a 

6246 G/A replacement in protein C. 

SEQ ID NO: 77 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6265 G. 

SEQ ID NO: 78 is an oligonucleotide sequence that can be used to probe for a 
6265 G/C replacement in protein C. 

SEQ ID NO: 79 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 6274 C. 

SEQ ID NO: 80 is an oligonucleotide sequence that can be used to probe for a 
6274 C/T replacement in protein C. 

SEQ ID NO: 81 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 7176 G. 
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SEQ ID NO: 82 is an oligonucleotide sequence that can be used to probe for a 
7176 G/A replacement in protein C. 

SEQ ID NO: 83 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 7253 C. 
5 SEQ ID NO: 84 is an oligonucleotide sequence that can be used to probe for a 

7253 C/T replacement in protein C. 

SEQ ID NO: 85 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8403 C. 

SEQ ID NO: 86 is an oligonucleotide sequence that can be used to probe for a 
10 8403 C/T replacement in protein C. 

SEQ ID NO: 87 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8481 A. 

SEQ ID NO: 88 is an oligonucleotide sequence that can be used to probe for a 
8481 A/G replacement in protein C. 
1 5 SEQ ID NO: 89 is an oligonucleotide sequence that can be used to probe for a 

wild-type protein C sequence at nucleotide position 8485/6. 

SEQ ID NO: 90 is an oligonucleotide sequence that can be used to probe for a 
8485/6 delAC or 8486/7 delCA in protein C. 

SEQ ID NO: 91 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein C sequence at nucleotide position 8551 C. 

SEQ ID NO: 92 is an oligonucleotide sequence that can be used to probe for a 
8551 C/T replacement in protein C. 

SEQ ID NO: 93 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8559 G. 
25 SEQ ID NO: 94 is an oligonucleotide sequence that can be used to probe for a 

8559 G/A replacement in protein C. 

SEQ ID NO: 95 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8571 C. 
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SEQ ID NO: 96 is an oligonucleotide sequence that can be used to probe for a 

8571 C/T replacement in protein C. 

SEQ ID NO: 97 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8572 G. 

SEQ ID NO: 98 is an oligonucleotide sequence that can be used to probe for a 

8572 G/A replacement in protein C. 

SEQ ID NO: 99 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8589 G. 

SEQ ID NO: 100 is an oligonucleotide sequence that can be used to probe for 
8589 G/A replacement in protein C. 

SEQ ID-NO: 101 is an oligonucleotide sequence that can be used to probe for 
wild-type protein C sequence at nucleotide position 8604 G. 

SEQ ID NO: 102 is an oligonucleotide sequence that can be used to probe for 
8604 G/A replacement in protein C. 

SEQ ID NO: 103 is an oligonucleotide sequence that can be used to probe for 
wild-type protein C sequence at nucleotide position 8608 C. 

SEQ ID NO: 104 is an oligonucleotide sequence that can be used to probe for 
8608 C/T replacement in protein C. 

SEQ ID NO: 105 is an oligonucleotide sequence that can be used to probe for 
wild-type protein C sequence at nucleotide position 8631 C. 

SEQ ID NO: 106 is an oligonucleotide sequence that can be used to probe for 
8631 C/T replacement in protein C. 

SEQ ID NO: 107 is an oligonucleotide sequence that can be used to probe for 
wild-type protem C sequence at nucleotide position 8678-80. 

SEQ ID NO: 108 is an oligonucleotide sequence that can be used to probe for 
8678-80 del3nt in protein C. 

SEQ ID NO: 109 is an oligonucleotide sequence that can be used to probe for 
wild-type protein C sequence at nucleotide position 8689 T. 
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SEQ ID NO: 1 10 is an oligonucleotide sequence that can be used to probe for a 
8689 T/C replacement in protein C. 

SEQ ID NO: 1 1 1 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8695 C. 
5 SEQ ID NO: 1 12 is an oligonucleotide sequence that can be used to probe for a 

8695 C/T replacement in protein C. 

SEQ ID NO: 113 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8763 G. 

SEQ ID NO: 1 14 is an oligonucleotide sequence that can be used to probe for a 
10 8763 G/A replacement in protein C. 

SEQ ID NO: 1 1 5 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8857. 

SEQ ID NO: 1 16 is an oligonucleotide sequence that can be used to probe for a 
8857 del G in protein C. 
1 5 SEQ ID NO: 1 1 7 is an oligonucleotide sequence that can be used to probe for a 

wild-type protein C sequence at nucleotide position 8895 A. 

SEQ ID NO: 11 8 is an oligonucleotide sequence that can be used to probe for a 
8895 A/C replacement in protein C. 

SEQ ID NO: 1 19 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein C sequence at nucleotide position 8924 C. 

SEQ ID NO: 120 is an oligonucleotide sequence that can be used to probe for a 
8924 C/G replacement in protein C. 

SEQ ID NO: 121 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 1387 C. 
25 SEQ ID NO: 122 is an oligonucleotide sequence that can be used to probe for a 

1387 C/T replacement in protein C. 

SEQ ID NO: 123 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 1388 G. 
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SEQ ID NO: 124 is an oligonucleotide sequence that can be used to probe for j 
1388 G/A replacement in protein C. 

SEQ ID NO: 125 is an oligonucleotide sequence that cin be used to probe for i 
wild-type protein C sequence at nucleotide position 1432 C. 

SEQ ID NO: 126 is an oligonucleotide sequence that can be used to probe for i 
1432 C/T replacement in protein C. 

SEQ ID NO: 127 is an oligonucleotide sequence that can be used to probe for i 
wild-type protein C sequence at nucleotide position 6218 C. 

SEQ ID NO: 128 is an oligonucleotide sequence that can be used to probe for i 

6218 C/T replacement in protein C. 

SEQ ID NO: 129 is an oligonucleotide sequence that can be used to probe for i 
wild-type protein C sequence at nucleotide position 6219 G. 

SEQ ID NO: 130 is an oligonucleotide sequence that can be used to probe for i 

6219 G/A replacement in protein C. 

SEQ ID NO: 13 1 is an oligonucleotide sequence that can be used to probe for i 
wild-type protein C sequence at nucleotide position 7219 C. 

SEQ ID NO: 1 32 is an oligonucleotide sequence that can be used to probe for i 
7219 C/A replacement in protein C. 

SEQ ID NO: 133 is an oligonucleotide sequence that can be used to probe for i 
wild-type protein C sequence at nucleotide position 8470 G. 

SEQ ID NO: 134 is an oligonucleotide sequence that can be used to probe for i 
8470 G/A replacement in protein C. 

SEQ ID NO: 135 is an oligonucleotide sequence that can be used to probe for < 
wild-type protein C sequence at nucleotide position 8744 G. 

SEQ ID NO: 136 is an oligonucleotide sequence that can be used to probe for i 
8744 G/A replacement in protein C. 

SEQ ID NO: 137 is an oligonucleotide sequence that can be used to probe for j 
wild-type protein C sequence at nucleotide position 8769 C. 
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SEQ ID NO: 138 is an oligonucleotide sequence that can be used to probe for a 
8769 C/T replacement in protein C. 

SEQ ID NO: 139 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8790 G. 
5 SEQ ID'NO: 140 is an oligonucleotide sequence that can be used to probe for a 

8790 G/A replacement in protein C. 

SEQ ID NO: 141 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position 8886 G. 

SEQ ID NO: 142 is an oligonucleotide sequence that can be used to probe for a 
10 8886 G/A replacement in protein C. 

SEQ ID NO: 143 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein C sequence at nucleotide position -1654 C. 

SEQ ID NO: 144 is an oligonucleotide sequence that can be used to probe for a 
1654 C/T replacement in protein C. 
1 5 SEQ ID NO: 145 is an oligonucleotide sequence that can be used to probe for a 

wild-type protein C sequence at nucleotide position - 1 64 1 A. 

SEQ ID NO: 146 is an oligonucleotide sequence that can be used to probe for a 
1641 A/G replacement in protein C. 

SEQ ID NO: 147 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein S sequence at position -34 TGC. 

SEQ ID NO: 148 is an oligonucleotide sequence that can be used to probe for a 
34, deIG in protein S. 

SEQ ID NO: 149 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at -24 GTG (where 24 is the codon position). 
25 SEQ ID NO: 1 50 is an oligonucleotide sequence that can be used to probe for a 

24 GTG/GAG replacement in protein S. 

SEQ ID NO: 1 5 1 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 19 GAA. 
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SEQ ID NO: 152 is an oligonucleotide sequence that can be used to probe for a 
19 GAA/TAA replacement in protein S. 

SEQ ID NO: 153 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 26, GAA. 
5 SEQ ID NO: 1 54 is an oligonucleotide sequence that can be used to probe for a 

26, GAA/GCA replacement in protein S. 

SEQ ID NO: 155 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 44. 

SEQ ID NO: 1 56 is an oligonucleotide sequence that can be used to probe for a 
10 44, delCTTA in protein S. 

SEQ ID NO: 157 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 46 GTT. 

SEQ ID NO: 1 58 is an oligonucleotide sequence that can be used to probe for a 
46 GTT/CTT replacement in protein S. 
1 5 SEQ ID NO: 1 59 is an oligonucleotide sequence that can be used to probe for a 

wild-type protein S sequence at position intron d, exon 4, +1. 

SEQ ID NO: 160 is an oligonucleotide sequence that can be used to probe for a 
intron d, G/A, exon 4, +1 replacement in protein S. 

SEQ ID NO: 161 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein S sequence at position 1 55 AAG. 

SEQ ID NO: 162 is an oligonucleotide sequence that can be used to probe for a 
155 AAG/GAG replacement in protein S. 

SEQ ID NO: 163 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 217 AAT. 
25 SEQ ID NO: 1 64 is an oligonucleotide sequence that can be used to probe for a 

217 AAT/AGT replacement in protein S. 

SEQ ID NO: 165 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 238 CAG. 
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SEQ ID NO: 166 is an oligonucleotide sequence that can be used to probe for a 
238 CAG/TAG replacement in protein S. 

SEQ ID NO: 167 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 265, 

SEQ ID NO: 168 is an oligonucleotide sequence that can be used to probe for a 
265, ins T insertion in protein S. 

SEQ ID NO: 169 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 293 TCA. 

SEQ ID NO: 170 is an oligonucleotide sequence that can be used to probe for a 
293 TCA/TGA replacement in protein S. 

SEQ ID NO: 171 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 295 GGC. 

SEQ ID NO: 172 is an oligonucleotide sequence that can be used to probe for a 
295 GGC/GTC replacement in protein S. 

SEQ ID, NO: 173 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position intron j, exon 10, +5. 

SEQ ID NO: 1 74 is an oligonucleotide sequence that can be used to probe for a 
intron j, G/A, exon 10, +5 replacement in protein S. 

SEQ ID NO: 175 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 349 GAA. 

SEQ ID NO: 176 is an oligonucleotide sequence that can be used to probe for a 
349 GAA/AAA replacement in protein S. 

SEQ ID NO: 177 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 372. 

SEQ ID NO: 178 is an oligonucleotide sequence that can be used to probe for a 
372 delCTTTTT, insAA in protein S. 

SEQ ID NO: 179 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position intron k. A, exon 12, -9. 
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SEQ ID NO: 180 is an oligonucleotide sequence that can be used to probe for a 
intron k, A/G, exon 12, -9 replacement in protein S. 

SEQ ID NO: 1 81 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 405 CTA. 
5 SEQ ID NO: 1 82 is an oligonucleotide sequence that can be used to probe for a 

405 CTA/CCA replacement in protein S. 

SEQ ID NO: 1 83 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 410 CGA. 

SEQ ID NO: 184 is an oligonucleotide sequence that can be used to probe for a 
10 410 CGA/TGA replacement in protein S. 

SEQ ID NO: 1 85 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 43 1 . 

SEQ ID NO: 186 is an oligonucleotide sequence that can be used to probe for a 
431 ins A in protein S. 

1 5 SEQ ID NO: 1 87 is an oligonucleotide sequence that can be used to probe for a 

wild-type protein S sequence at nucleotide position 465 TGG. . 

SEQ ID NO: 1 88 is an oligonucleotide sequence that can be used to probe for a 
465 TGG/TGA replacement in protein S. 

SEQ ID NO: 189 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein S sequence at position 474 CGT. 

SEQ ID NO: 190 is an oligonucleotide sequence that can be used to probe for a 
474 CGT/TGT replacement in protein S. 

SEQ ID NO: 191 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 522 CAG. 
25 SEQ ID NO: 1 92 is an oligonucleotide sequence that can be used to probe for a 

522 C AG/TAG replacement in protein S. 

SEQ ID NO: 193 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 534 CTG. 
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SEQ ID NO: 194 is an oligonucleotide sequence that can be used to probe for 
534 CTG/CGG replacement in protein S. 

SEQ ID NO: 195 is an oligonucleotide sequence that can be used to probe for 
wild-type protein S sequence at position 625 TOT. 

SEQ ID NO: 196 is an oligonucleotide sequence that can be used to probe for 
625 TGT/CGT replacement in protein S. 

SEQ ID NO: 197 is an oligonucleotide sequence that can be used to probe for 
wild-type protein S sequence at position -2 CGT. 

SEQ ID NO: 198 is an oligonucleotide sequence that can be used to probe for 
2, CGT/CTT replacement in protein S. 

SEQ ID NO: 199 is an oligonucleotide sequence that can be used to probe for 
wild-type protein S sequence at position 9 AAA. 

SEQ ID NO: 200 is an oligonucleotide sequence that can be used to probe for 
9 AAA/GAA replacement in protein S. 

SEQ ID NO: 201 is an oligonucleotide sequence that can be used to probe for 
wild-type protein S sequence at position intron e, G, exon 5, +5. 

SEQ ID NO: 202 is an oligonucleotide sequence that can be used to probe for 
intron e, G/A, exon 5, +5 replacement in protein S. 

SEQ ID NO: 203 is an oligonucleotide sequence that can be used to probe for 
wild-type protein S sequence at position -25. 

SEQ ID NO: 204 is an oligonucleotide sequence that can be used to probe for 
25, insT in protein S. 

SEQ ID NO: 205 is an oligonucleotide sequence that can be used to probe for 
wild-type protein S sequence at position 467 GTA. 

SEQ ID NO: 206 is an oligonucleotide sequence that can be used to probe for ; 
467 GTA/GGA replacement in protein S. 

SEQ ID NO: 207 is an oligonucleotide sequence that can be used to probe for ; 
wild-type protein S sequence at position 633. 
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SEQ ID NO: 208 is an oligonucleotide sequence that can be used to probe for a 
633 delAA in protein S. 

SEQ ID NO: 209 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 636 TAA. 
5 SEQ ID NO: 2 1 0 is an oligonucleotide sequence that can be used to probe for a 

636 TAA/TAT replacement in protein S. 

SEQ ID NO: 211 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 35 CCG. 

SEQ ID NO: 212 is an oligonucleotide sequence that can be used to probe for a 
1 0 35 CCG/CTG replacement in protein S. 

SEQ ID NO: 213 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position intron b, exon 2+5. 

SEQ ID NO: 214 is an oUgonucleotide sequence that can be used to probe for a 
intron b, G/A, exon 2 +5 replacement in protein S. 
1 5 SEQ ID NO: 2 1 5 is an oligonucleotide sequence that can be used to probe for a 

wild-type protein S sequence at position 303 ATC. 

SEQ ID NO: 216 is an oligonucleotide sequence that can be used to probe for a 
303 ATC/ATT replacement in protein S. 

SEQ ID NO: 217 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type protein S sequence at position intron k, exon 1 1 +54. 

SEQ ID NO: 218 is an oligonucleotide sequence that can be used to probe for a 
intron k, C/T, exon 1 1 +54 replacement in protein S. 

SEQ ID NO: 219 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 460 TCC. 
25 SEQ ID NO: 220 is an oligonucleotide sequence that can be used to probe for a 

460 TCC/CCC replacement in protein S. 

SEQ ID NO: 221 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position 626 CCA. 
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SEQ ID NO: 222 is an oligonucleotide sequence that can be used to probe for a 
626 CCA/CCG replacement in protein S. 

SEQ ID NO: 223 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position exon 15, T 18 nt after the stop codon. 
5 SEQ ID NO: 224 is an oligonucleotide sequence that can be used to probe for a 

exon 15, T/G 18 nt after the stop codon replacement in protein S. 

SEQ ID NO: 225 is an oligonucleotide sequence that can be used to probe for a 
wild-type protein S sequence at position exon 15, 520 nt after the stop codon. 

SEQ ID NO: 226 is an oligonucleotide sequence that can be used to probe for a 
10 exon 15, C/A 520 nt after the stop codon replacement in protein S. 

SEQ ID NO: 227 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 2770. 

SEQ ID NO: 228 is an oligonucleotide sequence that can be used to probe for a 
2770 insT in antithrombin III. 
1 5 SEQ ID NO: 229 is an oligonucleotide sequence that can be used to probe for a 

wild-type antithrombin III sequence at nucleotide positions 531 1-5320. 

SEQ ID NO: 230 is an oligonucleotide sequence that can be used to probe for a 
53 1 1-5320 del6bp in antithrombin III. 

SEQ ID NO: 23 1 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type antithrombin III sequence at nucleotide positions 5356-5364. 

SEQ ID NO: 232 is an oligonucleotide sequence that can be used to probe for a 
5356-5364 delCTT in antithrombin III. 

SEQ ID NO: 233 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 5381 C. 
25 SEQ ID NO: 234 is an oligonucleotide sequence that can be used to probe for a 

5381C/T replacement in antithrombin III. 

SEQ ID NO: 235 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 5390 C. 
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SEQ ID NO: 236 is an oligonucleotide sequence that can be used to probe for a 
5390 C/T replacement in antithrotnbin III. 

SEQ ID NO: 237 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 5493 A. 
5 SEQ ID NO: 238 is an oligonucleotide sequence that can be used to probe for a 

5493 A/G replacement in antithrombin III. 

SEQ ID NO: 239 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 6490 C. 

SEQ ID NO: 240 is an oligonucleotide sequence that can be used to probe for a 
1 0 6490 C/T replacement in antithrombin III. 

SEQ ID NO: 241 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 9788 G. 

SEQ ID NO: 242 is an oligonucleotide sequence that can be used to probe for a 
9788 G/A replacement in antithrombin III. 
1 5 SEQ ID NO: 243 is an oligonucleotide sequence that can be used to probe for a 

wild-type antithrombin III sequence at nucleotide position 9819 C. 

SEQ ID NO: 244 is an oligonucleotide sequence that can be used to probe for a 
9819 C/T replacement in antithrombin III. 

SEQ ID NO: 245 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type antithrombin III sequence at nucleotide position 1 3342. 

SEQ ID NO: 246 is an oligonucleotide sequence that can be used to probe for a 
13342 insA in antithrombin III. 

SEQ ID NO: 247 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13380 T. 
25 SEQ ID NO: 248 is an oligonucleotide sequence that can be used to probe for a 

13380 T/C replacement in antithrombin III. 

SEQ ID NO: 249 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 6460 A. 
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SEQ ID NO: 250 is an oligonucleotide sequence that can be used to probe for a 
6460 A/G replacement in antithrombin III. 

SEQ ID NO: 25 1 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13262 G. 
5 SEQ ID NO: 252 is an oligonucleotide sequence that can be used to probe for a 

13262 G/A replacement in antithrombin III. 

SEQ ID NO: 253 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13268 G. 

SEQ ID NO: 254 is an oligonucleotide sequence that c^ be used to probe for a 
10 1 3268 G/C replacement in antithrombin III. 

SEQ ID NO: 255 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13268 G. 

SEQ ID NO: 256 is an oligonucleotide sequence that can be used to probe for a 
13268 G/T replacement in antithrombin III. 
1 5 SEQ ID NO: 257 is an oligonucleotide sequence that can be used to probe for a 

wild-type antithrombin III sequence at nucleotide position 13295 C. 

SEQ ID NO: 258 is an oligonucleotide sequence that can be used to probe for a 

13295 C/T replacement in antithrombin III. 

SEQ ID NO: 259 is an oligonucleotide sequence that can be used to probe for a 
20 wild-type antithrombin III sequence at nucleotide position 1 3296 G. 

SEQ ID NO: 260 is an oligonucleotide sequence that can be used to probe for a 

13296 G/A replacement in antithrombin III. 

SEQ ID NO: 261 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13299 C. 
25 SEQ ID NO: 262 is an oligonucleotide sequence that can be used to probe for a 

13299 C/T replacement in antithrombin III. 

SEQ ID NO: 263 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 2484 T. 
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SEQ ID NO: 264 is an oligonucleotide sequence that can be used to probe for i 
2484 T/A replacement in antithrombin III. 

SEQ ID NO: 265 is an oligonucleotide sequence that can be used to probe for i 
wild-type antithrombin III sequence at nucleotide position 2586 C. 

SEQ ID NO: 266 is an oligonucleotide sequence that can be used to probe for i 
2586 C/T replacement in antithrombin III. 

SEQ ID NO: 267 is an oligonucleotide sequence that can be used to probe for i 
wild-type antithrombin III sequence at nucleotide position 2603 C. 

SEQ ID NO: 268 is an oligonucleotide sequence that can be used to probe for i 

2603 C/T replacement in antithrombin III. 

SEQ ID NO: 269 is an oligonucleotide sequence that can be used to probe for j 
wild-type antithrombin III sequence at nucleotide position 2604 G. 

SEQ ID NO: 270 is an oligonucleotide sequence that can be used to probe for i 

2604 G/A replacement in antithrombin III. 

SEQ ID NO: 271 is an oligonucleotide sequence that can be used to probe for i 
wild-type antithrombin III sequence at nucleotide position 2759 C. 

SEQ ID NO: 272 is an oligonucleotide sequence that can be used to probe for s 
2759 C/T replacement in antithrombin III. 

SEQ ID NO: 273 is an oligonucleotide sequence that can be used to probe for •< 
wild-type antithrombin III sequence at nucleotide position 5382 G. 

SEQ ID NO: 274 is an oligonucleotide sequence that can be used to probe for ; 
5382 G/A replacement in antithrombin III. 

SEQ ID NO: 275 is an oligonucleotide sequence that can be used to probe for i 
wild-type antithrombin III sequence at nucleotide position 13324 C. 

SEQ ID NO: 276 is an oligonucleotide sequence that can be used to probe for i 
13324 C/A replacement in antithrombin III. 

SEQ ID NO: 277 is an oligonucleotide sequence that can be used to probe for i 
wild-type antithrombin III sequence at nucleotide position 13328 G. 



WDN:SLR 4239-66342 Express Mail No. EV33 1 582525US 

Date of Deposit: January 15, 2004 

-26- 

SEQ ID NO: 278 is an oligonucleotide sequence that can be used to probe for a 
13328 G/A replacement in antithrombin III. 

SEQ ID NO: 279 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13333 C. 
5 SEQ ID NO: 280 is an oligonucleotide sequence that can be used to probe for a 

13333 C/G replacement in antithrombin III. 

SEQ ID NO: 281is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13337 C. 

SEQ ID NO: 282 is an oligonucleotide sequence that can be used to probe for a 
10 13337 C/A replacement in antithrombin III. 

SEQ ID NO: 283 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 13338 C. 

SEQ ID NO: 284 is an oligonucleotide sequence that can be used to probe for a 
13338 C/T replacement in antithrombin III. 
1 5 SEQ ID NO: 285 is an oligonucleotide sequence that can be used to probe for a 

wild-type antithrombin III sequence at nucleotide position 13392 G. 

SEQ ID NO: 286 is an oligonucleotide sequence that can be used to probe for a 
13392 G/C replacement in antithrombin III. 

SEQ ID NO: 287 is an oligonucleotide sequence that can be used to probe for a 
20 1 08 bp allele of an antithrombin III sequence, to determine the presence of a wild-type 
sequence. 

SEQ ID'NO: 288 is an oligonucleotide sequence that can be used to probe for a 
32 bp allele of an antithrombin III, to determine the presence of a 76 bp dimorphism. 

SEQ ID NO: 289 is an oligonucleotide sequence that can be used to probe for a 
25 wild-type antithrombin III sequence at nucleotide position 7596 G. 

SEQ ID NO: 290 is an oligonucleotide sequence that can be used to probe for a 
7596 G/A replacement in antithrombin III. 
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SEQ ID NO: 291 is an oligonucleotide sequence that can be used to probe for a 
wild-type antithrombin III sequence at nucleotide position 7626 G. 

SEQ ID NO: 292 is an oligonucleotide sequence that can be used to probe for a 
7626 G/A replacement in antithrombin III. 
5 SEQ ID NO: 293 is an oligonucleotide sequence that can be used to probe for a 

wild-type antithrombin III sequence at nucleotide position 7987 T. 

SEQ ID NO: 294 is an oligonucleotide sequence that can be used to probe for a 
7987 T/C replacement in antithrombin III. 

SEQ ID NO: 295 is an oligonucleotide sequence that can be used to probe for a 
1 0 wild-type antithrombin III sequence at nucleotide position 9893 C. 

SEQ ID NO: 296 is an oligonucleotide sequence that can be used to probe for a 
9893 C/G replacement in antithrombin III. 

SEQ ID NO: 297 is an oligonucleotide sequence that can be used to probe for a 
wild-type ACE sequence with 288 bp insertion in intron 16. 
1 5 SEQ ID NO: 298 is an oligonucleotide sequence that can be used to probe for a 

mutant type ACE sequence with 288 bp deletion in intron 16. 

SEQ ID NO: 299 is an oligonucleotide sequence that can be used to probe for a 
wild-type MTHFR sequence at nucleotide position 677 C. 

SEQ ID NO: 300 is an oligonucleotide sequence that can be used to probe for a 
20 677 C/T replacement in MTHFR. 

SEQ ID NO: 301 is an oligonucleotide sequence that can be used to probe for a 
wild-type MTHFR sequence at nucleotide position 1298 A. 

SEQ ID NO: 302 is an oligonucleotide sequence that can be used to probe for a 
1298 A/C replacement in MTHFR. 



25 
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DETAILED DESCRIPTION OF SEVERAL EMBODIMENTS 
Abbreviations and Terms 

The following explanations of terms and methods are provided to better describe 
the present disclosure and to guide those of ordinary skill in the art in the practice of the 
present disclosure. The singular forms "a," "an," and "the" refer to one or more than 
one, unless the context clearly dictates otherwise. For example, the term "comprising a 
nucleic acid" includes single or plural nucleic acids and is considered equivalent to the 
phrase "comprising at least one nucleic acid." The term "or" refers to a single element 
of stated alternative elements or a combination of two or more elements, unless the 
context clearly indicates otherwise. As used herein, "comprises" means "includes." 
Thus, "comprising A or B," means "including A, B, or A and B," without excluding 
additional elements. 

Unless explained otherwise, all technical and scientific terms used herein have 
the same meaning as commonly understood to one of ordinary skill in the art to which 
this disclosure belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present disclosure, suitable 
methods and materials are described below. The materials, methods, and examples are 
illustrative only and not intended to be limiting. 

Africans: A human racial classification that includes persons having origins in 
any of the black racial groups of Africa. Includes dark-skinned persons who are natives 
or inhabitants of Africa, as well as persons of African descent, such as African- 
Americans. Such persons also retain substantial genetic similarity to natives or 
inhabitants of Africa. In a particular example, an African is at least 1/64 African. 

Amplifying a nucleic acid molecule: To increase the number of copies of a 
nucleic acid molecule, such as a gene or fragment of a gene, for example a region of a 
venous thrombosis (VT)-associated gene. The resulting amplification products are 
called amplification products. 
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An example of /« vitro amplification is the polymerase chain reaction (PCR), in 
which a biological sample obtained from a subject is contacted with a pair of 
oligonucleotide primers, under conditions that allow for hybridization of the primers to 
a nucleic acid molecule in the sample. The primers are extended under suitable 
5 conditions, dissociated from the template, and then re-annealed, extended, and 
dissociated to amplify the number of copies of the nucleic acid molecule. Other 
examples of m vitro amplification techniques include quantitative real-time PCR, strand 
displacement amplification (see USPN 5,744,31 1); transcription-fi-ee isothermal 
amplification (see USPN 6,033,881); repair chain reaction amplification (see WO 
10 90/01069); ligase chain reaction amplification (see EP-A-320 308); gap filling ligase 
chain reaction amplification (see USPN 5,427,930); coupled ligase detection and PCR 
(see USPN 6,027,889); and NASBA™ RNA transcription-fi-ee amplification (see USPN 
6,025,134). . 

Angiotensin I-converting enzyme (ACE): An enzyme that converts 
1 5 angiotensin I into the vasoconstrictor angiotensin II, and is involved in the degradation 
of bradykinin. Includes any ACE gene, cDNA, RNA, or protein from any organism, 
such as a human. Examples include the sequence disclosed in GenBank Accession No. 
BC048144 (as well as the corresponding genomic and protein sequence). 

At least one variation in human ACE is associated with venous thrombosis: a 
20 polymorphism consisting of an insertion (ins) or deletion (del) of a 288-bp fragment in 
intron 16. 

Anticoagulants: Agents that decrease or prevent abnormal blood clotting. 
Anticoagulants can avoid the formation of new clots, and prevent existing clots from 
growing (extending), for example by decreasing or stopping the production of proteins 
25 necessary for blood to clot. Examples include, but are not limited to, aspirin, heparin, 
and warfarin (Coumadin). 

Antithrombin III (AT III): A member of the serpin (serine proteinase 
inhibitor) superfamily of proteins. AT III is the principal thrombin inhibitor, and has 



WDN:SLR 4239-66342 Express Mail No. EV33 1582525US 

Date of.Deposit: January 15, 2004 

-30- 

inhibitory effects on other coagulation factors, such as factors IXa, Xa, XIa and Xlla. In 
addition, AT III accelerates the dissociation of the factor Vila-tissue factor complex and 
prevents its rebinding. Includes the product of any AT III gene, cDNA, or RNA, or an 
AT III protein from any organism, such as a human. Examples include the mRNA 
5 sequence disclosed in GenBank Accession No. NM_000488 (as well as the 

corresponding genomic and protein sequence). The gene coding for human AT III is 
localized on chromosome lq23-25, spans 13.4 kb of DNA and has seven exons. 

Heterozygous AT III deficiency is associated with increased risk for venous 
thrombosis. The molecular basis of AT III deficiency is highly heterogeneous. AT III 
1 0 deficiency is divided into type I (low plasma levels of both fiinctional and 

immunological AT III) and type II (variant AT III in plasma). Type II is fiorther 
subdivided into RS (defective reactive site), HBS (defective heparin-bmding site) and 
PE (pleitropic, that is, multiple effects on fimction). 

There are at least 127 distinct defects associated with AT III deficiency: 92 
1 5 mutations for type I AT III deficiency (40 point mutations, 40 small insertions or 

deletions and 12 large deletions) and 35 mutations for type II AT III deficiency (12 RS, 
12 HBS and 1 1 PE mutations, all point mutations). Among the type I mutations, at least 
1 1 distinct mutations (7 point mutations and 4 deletions or insertions) have been 
described in multiple unrelated kindreds and the remaining mutations have been unique 
20 to single families which makes them individual mutations. In type II, 19 of the 35 
mutations (seven RS, six HBS and six PE mutations) have been described in multiple 
unrelated kindreds and the remaining have been reported to be individual mutations. 

In addition to causative AT III gene mutations in subjects with hereditary AT III 
deficiency, five distinct recurrent AT III gene polymorphisms cosegregate with the 
25 deficient phenotype in families with hereditary AT III deficiency. 

Exemplary recurreiit AT III gene mutations and polymorphisms related to 
recurrent venous thrombosis are shovwi in Table 1 . 
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Array: An arrangement of molecules, such as biological macromolecules (such 
as polypeptides or nucleic acids) or biological samples (such as tissue sections), in 
addressable locations on or in a substrate. A "microarray" is an array that is 
miniaturized so as to require or be aided by microscopic examination for evaluation or 
analysis. Arrays are sometimes called DNA chips or biochips. 

The array of molecules ("features") makes it possible to carry out a very large 
number of analyses on a sample at one time. In certain example arrays, one or more 
molecules (such as an oligonucleotide probe) will occur on the array a plurality of times 
(such as twice), for mstance to provide internal controls. The number of addressable 
locations on the array can vary, for example from a few (such as three) to at least 50, 
100, 200, 300, 500, 600, 1000, 10,000, or more. In particular examples, an array 
includes nucleic acid molecules, such as oligonucleotide sequences that are at least 15 
nucleotides in length, such as about 15-40 nucleotides in length. In particular examples, 
an array includes SEQ ID NOS: 1-302, or subsets thereof, such as even-numbered SEQ 
ID NOS: 2-302 (to detect wild-type VT-associated sequences), or odd-numbered SEQ 
ID NOS: 1 -30 1 (to detect mutant or polymorphic VT-associated sequences). 

Within an array, each arrayed sample is addressable, in that its location can be 
reliably and consistently determined within the at least two dimensions of the array. 
The feature application location on an array can assume different shapes. For example, 
the array can be regular (such as arranged in uniform rows and columns) or irregular. 
Thus, in ordered arrays the location of each sample is assigned to the sample at the time 
when it is applied to the array, and a key may be provided in order to correlate each 
location with the appropriate target or feature position. Often, ordered arrays are 
arranged in a symmetrical grid pattern, but samples could be arranged in other pattems 
(such as in radially distributed lines, spiral lines, or ordered clusters). Addressable 
arrays usually are computer readable, in that a computer can be programmed to correlate 
a particular address on the array with information about the sample at that position (such 
as hybridization or binding data, including for instance signal intensity). In some 
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examples of computer readable formats, the individual features in the array are arranged 
regularly, for instance in a Cartesian grid pattern, which can be correlated to address 
information by a computer. 

Also contemplated herein are protein-based arrays, where the probe molecules 
are or include proteins, and/or where the target molecules are or include proteins, and 
arrays including nucleic acids to which proteins/peptides are bound, or vice versa. 

Asians: A human racial classification that includes persons having origins in 
any of the original peoples of the Far East, Southeast Asia, the Indian subcontinent, or 
the Pacific Islands (such as China, India, Japan, Korea, the Philippine Islands, and 
Samoa), as well as persons of Asian descent, such as Asian- Americans. Such persons 
also retain substantial genetic similarity to natives or inhabitants of Asia. In a particular 
example, an Asian is at least 1/64 Asian. 

Binding or stable binding: An association between two substances or 
molecules, such as the hybridization of one nucleic acid molecule to another (or itself) 
and the association of an antibody with a peptide. An oligonucleotide molecule binds or 
stably binds to a target nucleic acid molecule if a sufficient amount of the 
oligonucleotide molecule forms base pairs or is hybridized to its target nucleic acid 
molecule, to permit detection of that binding. Binding can be detected by any procedure 
known to one skilled in the art, such as by physical or functional properties of the 
targetroligonucleotide complex. For example, binding can be detected fiinctionally by 
determining whether binding has an observable effect upon a biosynthetic process such 
as expression of a gene, DNA replication, transcription, translation, and the like. 

Physical methods of detecting the binding of complementary strands of nucleic 
acid molecules, include but are not limited to, such methods as DNase I or chemical 
footprinting, gel shift and affinity cleavage assays. Northern blotting, dot blotting and 
light absorption detection procedures. For example, one method involves observing a 
change in light absorption of a solution containing an oligonucleotide (or an analog) and 
a target nucleic acid at 220 to 300 nm as the temperature is slowly increased. If the 
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oligonucleotide or analog has bound to its target, there is a sudden increase in 
absorption at a characteristic temperature as the oligonucleotide (or analog) and target 
disassociate from each other, or melt. In another example, the method involves 
detecting a signal, such as a detectable label, present on one or both complementary 
5 strands. 

The binding between an oligomer and its target nucleic acid is frequently 
characterized by the temperature (T„) at which 50% of the oligomer is melted from its 
target. A higher (T J means a sfronger or more stable complex relative to a complex 
with a lower (T„). 

10 Caucasians: A htunan racial classification fraditionally distinguished by 

physical characteristics such as very light to brown skin pigmentation and sfraight to 
wavy or curly hair, which includes persons having origins in any of the original peoples 
of Europe, North Africa, or the Middle East. Popularly, the word "white" is used 

synonymously with "Caucasian" in North America. Such persons also retain substantial 

I"' 

1 5 genetic similarity to natives or inhabitants of Europe, North Africa, or the Middle East. 
In a particular example, a Caucasian is at least 1/64 Caucasian. 

cDNA (complementary DNA): A piece of DNA lacking internal, non-coding 
segments (introns) and regulatory sequences which determine transcription. cDNA can be 
synthesized by reverse transcription from messenger RNA extracted from cells. 

20 Complementarity and percentage complementarity: Molecules with 

complementary nucleic acids form a stable duplex or triplex when the strands bind, 
(hybridize), to each other by forming Watson-Crick, Hoogsteen or reverse Hoogsteen 
base pairs. Stable binding occurs when an oligonucleotide molecule remains detectably 
bound to a target nucleic acid sequence under the required conditions. 

25 Complementarity is the degree to which bases in one nucleic acid strand base 

pair with the bases in a second nucleic acid strand. Complementarity is conveniently 
described by percentage, that is, the proportion of nucleotides that form base pairs 
between two strands or within a specific region or domain of two strands. For example. 
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if 10 nucleotides of a 15-nucleotide oligonucleotide form base pairs with a targeted 
region of a DNA molecule, that oligonucleotide is said to have 66.67% complementarity 
to the region of DNA targeted. 

In the present disclosure, "sufficient complementarity" means that a sufficient 
number of base pairs exist between an oligonucleotide molecule and a target nucleic 
acid sequence (such as factor V, factor 11, fibrinogen, protein C, protein S, antithrombin 
III, ACE, and MTHFR) to achieve detectable binding. When expressed or measured by 
percentage of base pairs formed, the percentage complementarity that fulfills this goal 
can range from as little as about 50% complementarity to full (100%) complementary. 
In general, sufficient complementarity is at least about 50%, for example at least about 
75% complementarity, at least about 90% complementarity, at least about 95% 
complementarity, at least about 98% complementarity, or even at least about 100% 
complementarity. 

A thorough treatment of the qualitative and quantitative considerations involved 
in establishing binding conditions that allow one skilled in the art to design appropriate 
oligonucleotides for use under the desired conditions is provided by Beltz et al 
Methods Enzymol 100:266-285, 1983, and by Sambrook et al. (ed), Molecular 
Cloning: A Laboratory Manual, 2nd ed., vol. 1-3, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1989. 

DNA (deoxyribonucleic acid): A long chain polymer' which includes the 
genetic material of most living organisms (some viruses have genes including 
ribonucleic acid, RNA). The repeating units in DNA polymers are four different 
nucleotides, each of which includes one of the four bases, adenine, guanine, cytosine 
and thymine boxmd to a deoxyribose sugar to which a phosphate group is attached. 
Triplets of nucleotides, referred to as codons, in DNA molecules code for amino acid in 
a polypeptide. The term codon is also used for the corresponding (and complementary) 
sequences of three nucleotides in the mRNA into which the DNA sequence is 
transcribed. 
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Deletion: The removal of one or more nucleotides from a nucleic acid sequence 
(or one or more amino acids from a protein sequence), the regions on either side of the 
removed sequence being joined together. 

Factor V (FV): A protein that can act as a cofactor in the conversion of 
prothrombin to thrombin by factor Xa and a-thrombin. Includes the product of any FV 
gene, cDNA, or RNA, or a FV protein from any organism, such as a human. Examples 
of FV nucleic acid sequences include the mRNA sequence disclosed in GenBank 
Accession No. NM_000130 (as well as the corresponding genomic and protein 
sequence). 

FV circulates in the plasma as a 330-kDa single chain glycoprotein. 
Downregulation of the procoagulant activity of activated FV (FVa) is accomplished by 
activated protein C (APC)-mediated proteolysis of FVa at three different sequential 
cleavage sites. Factor V is first cleaved at Arg 506, then at Arg 306, and finally at Arg 
679. The cleavage of the peptide bond at Arg 506 is needed for the subsequent optimal 
exposure of cleavage sites at Arg 306 and Arg 679. Peptide bond cleavage at Arg 306 
accounts for the initial 70% loss of activity and the subsequent cleavage at Arg 679 is 
responsible for the loss of the remaining activity. 

At least five single nucleotide substitutions in the human FV gene are associated 
with increased thrombosis risk: 16910^ A transition that results in a Arg506Gln 
polymorphism; 1628 G -> A transition that results in a R485K polymorphism; 1091 G 
-> C transition that results in a Arg306Thr mutation; 1090 A -> G transition that results 
in a Arg306Gly mutation; and 4070 A -> G transition that resvdts in a His 1299 Arg 
polymorphism. 

Fibrinogen: A plasma protein with multiple ftmctions in blood clotting, such as 
fibrin clot formation, factor Xlll-mediated fibrin crosslinking, nonsubstrate thrombin 
binding, platelet aggregation, and fibrinolysis. Includes the product of any fibrinogen 
gene, cDNA, RNA, or a fibrinogen protein from any organism, such as a human. 
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Examples of fibrinogen nucleic acid sequences include the mRNA sequences disclosed 

in GenBank Accession Nos. NM_021871.1, BC007030, and NM_02 1870 (for the a, p, 
and y) subunits respectively, as well as the corresponding genomic and protein 
sequences). 

5 Human fibrinogen is a 340-kDa glycoprotein, composed of two identical 

subunits linked through a disulfide bond. Each subunit includes three polypeptide 
chains (a, p, and y), which are encoded by three separate genes on the long arm of 
human chromosome 4. Dysfibrinogenemia is caused by a variety of structural 
abnormalities in the fibrinogen molecule that result in abnormal fibrinogen fimction. 

1 0 At least 25 single fibrinogen mutations (22 single nucleotide substitutions, one 

insertion and two deletions) are associated with increased thrombosis risk, and include 
the Thr3 12Ala polymorphism. At least thirteen mutations have been described in 
multiple reports from different unrelated kindreds and the remaining mutations have 
been unique to single families which makes them individual mutations. 

1 5 Exemplary recurrent thrombophilic fibrinogen gene mutations and one common 

polymorphism related to venous thrombosis are shown in Table 1. 

Genetic predisposition: Susceptibility of a subject to a genetic disease, such as 
venous thrombosis. However, such a susceptibility may or may not resuh in actual 
development of the disease. 

20 Hybridization: To form base pairs between complementary regions of two 

strands of DNA, RNA, or between DNA and RNA, thereby forming a duplex molecule. 
Hybridization conditions resulting in particular degrees of stringency will vary 
depending upon the nature of the hybridization method and the composition and length 
of the hybridizing nucleic acid sequences. Generally, the temperature of hybridization 

25 and the ionic strength (such as the Na+ concentration) of the hybridization buffer will 
determine the stringency of hybridization. Calculations regarding hybridization 
conditions for attaining particular degrees of stringency are discussed in Sambrook et 
al, (1989) Molecular Cloning, second edition. Cold Spring Harbor Laboratory, 
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Plainview, NY (chapters 9 and 1 1). The following is an exemplary set of hybridization 
conditions and is not limiting: 

Very High Stringency (detects sequences that share 90% identity) 

Hybridization: 5x SSC at eS'C for 1 6 hours 

5 Wash twice: 2x SSC at room temperature (RT) for 1 5 minutes each 

Wash twice: 0.5x SSC at 65°C for 20 minutes each 



High Stringency (detects sequences that share 80% identity or greater) 
Hybridization: 5x-6x SSC at 65°C-70°C for 1 6-20 hours 

10 Wash twice: 2x SSC at RT for 5-20 minutes each 

Wash twice: Ix SSC at 55*'C-70°C for 30 miiiutes each 



Low Strii^ency (detects sequences that share greater than 50% identity) 
Hybridization: 6x SSC at RT to 55''C for 1 6-20 hours 

1 5 Wash at least twice: 2x-3x SSC at RT to 55°C for 20-30 minutes each. 

Insertion: The addition of one or more nucleotides to a nucleic acid sequence, 
or the addition of one or more amino acids to a protein sequence. 

Isolated: An "isolated" biological component (such as a nucleic acid molecule, 
protein, or organelle) has been substantially separated or purified away from other 
20 biological components in the cell of the organism in which the component naturally 
occurs, such as other chromosomal and extra-chromosomal DNA and RNA, proteins 
and organelles. Nucleic acids and proteins that have been "isolated" include nucleic 
acids and proteins purified by standard purification methods. The term also embraces 
nucleic acids and proteins prepared by recombinant expression in a host cell as well as 
25 chemically synthesized nucleic acids and proteins. 

Label: An agent capable of detection, for example by ELISA, 
spectrophotometry, flow cytometry, or microscopy. For example, a label can be 
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attached to a nucleic acid, thereby permitting detection of the nucleic acid. Examples of 
labels include, but are not limited to, radioactive isotopes, enzyme substrates, co-factors, 
ligands, chemiluminescent agents, fluorophores, haptens, enzymes, and combinations 
thereof. Methods for labeling and guidance in the choice of labels appropriate for 
various purposes are discussed for example in Sambrook et al. (Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor, New York, 1989) and Ausubel et al. (In 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 

Methylenetetrahydrofolate reductase (MTHFR): A protein that participates 
in the remethylation pathway of intracellular homocysteine metabolism. In the 
remethylation pathway catalyzed by methionine synthase, cobalamin acts as a cofactor 
and the methyl group is donated by 5-methyI-tetrahydrofolate, which derives from the 
reduction of 5, lO-methylenetetrahydrofolate by MTHFR. Includes the product of any 
MTHFR gene, cDNA, or RNA, or an MTHFR protein from any organism, such as a 
human. Examples include the mRNA sequence disclosed in GenBank Accession No. 
NM_005957 (as well as the corresponding genomic and protein sequence). 

The human MTHFR gene is located on chromosome lp36.3, includes -17 kb of 
DNA and has 1 1 exons. At least two polymorphisms in human MTHRF are associated 
with venous thrombosis: a 677 C^T polymorphism and a 1298 A->C polymorphism. 

Mutation: Any change of a nucleic acid sequence as a source of genetic 
variation. For example, mutations can occur within a gene or chromosome, including 
specific changes in non-coding regions of a chromosome, for instance changes in or 
near regulatory regions of genes. Types of mutations include, but are not limited to, 
base substitution point mutations (such as transitions or transversions), deletions, and 
insertions. Missense mutations are those that introduce a different amino acid into the 
sequence of the encoded protein; nonsense mutations are those that infroduce a new stop 
codon; and silent mutations are those that introduce the same amino acid often with a 
base change in the third position of codon. In the case of insertions or deletions, 
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mutations can be in-frame (not changing the frame of the overall sequence) or frame 
shift mutations, which may result in the misreading of a large mmiber of codons (and 
often leads to abnormal termination of the encoded product due to the presence of a stop 
codon in the alternative frame). 
5 Nucleic acid array: An arrangement of nucleic acids (such as DNA or RNA) in 

assigned locations on a matrix, such as that found in cDNA arrays, or oligonucleotide 
arrays. 

Nucleic acid molecules representing genes: Any nucleic acid, for example 
DNA (mtron or exon or both), cDNA or RNA, of any length suitable for use as a probe 
10 or other indicator molecule, and that is informative about the corresponding gene. 

Nucleic acid molecules: A deoxyribonucleotide or ribonucleotide polymer 
including, without limitation, cDNA, mRNA, genomic DNA, and synthetic (such as 
chemically synthesized) DNA. The nucleic acid molecule can be double-stranded or 
single-stranded. Where single-stranded, the nucleic acid molecule can be the sense 
1 5 strand or the antisense strand. In addition, nucleic acid molecule can be circular or 
linear. 

The disclosure includes isolated nucleic acid molecules that include specified 
lengths of a VT-related nucleotide sequence. Such molecules can include at least 10, 15, 
20, 25, 30, 35, 40, 45 or 50 consecutive nucleotides of these sequences or more, and can 
20 be obtained from any region of a VT-related nucleic acid molecule. 

Nucleotide: Includes, but is not limited to, a monomer that includes a base linked 
to a sugar, such as a pyrimidine, purine or synthetic analogs thereof, or a base linked to an 
amino acid, as in a peptide nucleic acid (PNA). A nucleotide is one monomer in a 
polynucleotide. A nucleotide sequence refers to the sequence of bases in a 
25 polynucleotide. 

Oligonucleotide: An oligonucleotide is a plurality of joined nucleotides joined by 
native phosphodiester bonds, between about 6 and about 300 nucleotides in length. An 
oligonucleotide analog refers to moieties that fimction similarly to oligonucleotides but 
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have non-naturally occurring portions. For example, oligonucleotide aneilogs can contain 
non-naturally occurring portions, such as altered sugar moieties or inter-sugar linkages, 
such as a phosphorothioate oligodeoxynucleotide. 

Particular oligonucleotides and oligonucleotide analogs can include linear 
5 sequences up to about 200 nucleotides in length, for example a sequence (such as DNA or 
RNA) that is at least 6 bases, for example at least 8, 10, 15, 20, 21, 25, 30, 35, 40, 45, 50, 
100 or even 200 bases long, or from about 6 to about 50 bases, for example about 10-25 
bases, such as 12, 15 or 20 bases. 

Oligonucleotide probe: A short sequence of nucleotides, such as at least 8, 10, 

10 15, 20, 21, 25, or 30 nucleotides in length, used to detect the presence of a 
complementary sequence by molecular hybridization. In particular examples, 
oligonucleotide probes include a label that permits detection of oligonucleotide 
probe :target sequence hybridization complexes. 

Operably linked: A first nucleic acid sequence is operably linked with a 

1 5 second nucleic acid sequence when the first nucleic acid sequence is placed in a 

fimctional relationship with the second nucleic acid sequence. For instance, a promoter 
is operably linked to a coding sequence if the promoter affects the transcription or 
expression of the coding sequence. Generally, operably linked DNA sequences are 
contiguous and, where necessary to join two protein-coding regions, in the same reading 

20 fi-ame. 

Open reading frame (ORF): A series of nucleotide triplets (codons) coding for 
amino acids without any internal termination codons. These sequences are usually 
translatable into a peptide. 

Polymorphism: As a result of mutations, a gene sequence may differ among 
25 individuals. The differing sequences are referred to as alleles. The alleles that are 
present at a given locus (a gene's location on a chromosome is termed as a locus) are 
referred to as the individual's genotype. Some loci vary considerably among 
individuals. If a locus has two or more alleles whose fi-equencies each exceed 1% in a 
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population, the locus is said to be polymorphic. The polymorphic site is termed a 
polymorphism. The term polymorphism also encompasses variations that produce gene 
products with altered function, that is, variants in the gene sequence that lead to gene 
products that are not functionally equivalent. This term also encompasses variations 
that produce no gene product, an inactive gene product, or increased or decreased 
activity gene product or even no biological effect. 

Polymorphisms can be referred to, for instance, by the nucleotide position at 
which the variation exists, by the change in amino acid sequence caused by the 
nucleotide variation, or by a change in some other characteristic of the nucleic acid 
molecule or protein that is linked to the variation. 

Primers: Short nucleic acid molecules, for instance DNA oligonucleotides 10 - 
100 nucleotides in length, such as about 15, 20, 25, 30 or 50 nucleotides or more in 
length. Primers can be annealed to a complementary target DNA strand by nucleic acid 
hybridization to form a hybrid between the primer and the target DNA strand. Primer 
pairs can be used for amplification of a nucleic acid sequence, such as by PGR or other 
nucleic acid amplification methods known in the art. 

Methods for preparing and using nucleic acid primers are described, for example, 
in Sambrook et al. (In Molecular Cloning: A Laboratory Manual, CSHL, New York, 
1989), Ausubel al. (ed.) (In Current Protocols in Molecular Biology, John Wiley & 
Sons, New York, 1998), and Innis et al. {PGR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc., San Diego, CA, 1990). PGR primer pairs can be 
derived from a known sequence, for example, by using computer programs intended for 
that purpose such as Primer (Version 0.5, © 1991, Whitehead Institute for Biomedical 
Research, Cambridge, MA). One of ordinary skill in the art will appreciate that the 
specificity of a particular primer increases with its length. Thus, for example, a primer 
including 30 consecutive nucleotides of a VT-related protein encoding nucleotide will 
anneal to a target sequence, such as another homolog of the designated VT-related 
protein, with a higher specificity than a corresponding primer of only 15 nucleotides. 
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Thus, in order to obtain greater specificity, primers can be selected that includes at least 
20, 25, 30, 35, 40, 45, 50 or more consecutive nucleotides of a VT-related protein- 
encoding nucleotide sequences. 

Protein C (PC): PC is activated after the binding of thrombin to its endothelial 
5 receptor, thrombomodulin. Activated PC inhibits clot formation by cleaving and 
inactivating factors Va and Villa. Includes the product of any PC gene, cDNA, or 
RNA, or a PC protein from any organism, such as a human. Examples include the 
mRNA sequence disclosed in GenBank Accession No. BC034377. 1 (as well as the 
corresponding genomic and protein sequence). 
10 The human PC gene is localized to human chromosome 2ql3-14; it spans 

approximately 10 kb and contains nine exons. Loss-of-function mutations in the PC 
gene result in deficiency of PC, which is a well-established cause of VT. PC 
deficiency is classified into type I (low plasma concentrations of both functional and 
immunologic PC) and type II (low plasma levels of fimctional protein with normal 
15 antigen levels). 

Among the at least 161 different PC gene mutations related to venous 
thrombosis in humans, at least 5 1 distinct mutations (48 point mutations, 2 deletions 
and 1 insertion) have been described in multiple unrelated kindreds and the 
remaining mutations are unique to single families which makes them individual 
20 mutations. At least forty recurrent mutations are associated with type I PC 

deficiency and 1 1 mutations have been found in patients with type II PC deficiency. 

Two polymorphic sites (nt -1654C/T and -1641A/G) located in the 5' 
untranslated region of the PC gene also have an effect on plasma PC levels. 
Subjects carrying the CO allele have lower plasma PC levels than subjects with the 
25 other genotypes and this allele is a risk fector for venous thrombosis. 

Exemplary recurrent PC gene mutations and polymorphisms related to 
venous thrombosis are shown in Table 1. 
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Protein S (PS): A non-enzymatic cofactor for activated PC in the proteolytic 
inactivation of factors Va and Villa. Includes the product of any PS gene, cDNA, or 
RNA, or a PS protein from any organism, such as a human. Examples include the 
mRNA sequence disclosed in GenBank Accession No. NM_000313.1 (as well as the 
5 corresponding genomic and protein sequence). 

Human DNA contains two PS genes: the active PROSl gene and the 
pseudogene PRS02, which map to 3pl 1.1-ql 1.2. PRSOl spans 80 kb genomic 
DNA and includes 15 exons and 14 introns. Loss-of-function mutations in PRSOl 
lead to a deficiency of PS. Three types of PS deficiency are recognized based on 
1 0 plasma measurements: type I is characterized by low total and free PS antigen 

levels, type II by decreased activity and normal total and free PS antigen levels and 
type III by a selective reduction in free PS levels. 

Among at least 131 different PS gene mutations related to venous thrombosis in 
humans, at least 32 distinct mutations (25 point mutations, 3 deletions, 3 insertions, and 
1 5 1 deletion and insertion) have been described in multiple unrelated kindreds and the 
remaining mutations have been unique to single families which makes them individual 
mutations. Twenty-five recurrent mutations have been reported to be associated with 
quantitative (type I and/or type III) PS deficiency, 3 mutations with qualitative (type II ) 
PS deficiency and type of the PS deficiency could not be determined in the remaining 4 
20 mutations, either because one of the plasma assays was missing or because the subject 
was on oral anticoagulant therapy. Eight recurrent polymorphisms in the PS gene 
cosegregate with the deficient phenotype in families with hereditary PS deficiency. 

Exemplary recurrent PS gene mutations and polymorphisms related to 
venous thrombosis are shown in Table 1. 
25 Prothrombin (Factor II, FII): The precursor of serine protease thrombin, 

which is a vitamin K-dependent glycoprotein. Activated by FXa (in the presence of 
FVa and phospholipids), Flla exhibits procoagulant, anticoagulant, and antifibrinolytic 
activities. Includes the product of any FII gene, cDNA, or RNA, or a FII protein from 
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any organism, such as a human. Examples include the mRNA sequence disclosed in 
GenBank Accession No. V00595.1 (as well as the corresponding genomic and protein 
sequence). 

The human gene coding for FII is localized on chromosome 11, band 1 Ipl l-ql2 
5 and spans 21 kb of DNA. The FII gene is organized in 14 exons, separated by 13 
introns, with 5' and 3 '-untranslated (UT) regions. 

At least one single nucleotide substitution in the human FII gene is associated 
with increased thrombosis risk: to A polymorphism at nucleotide 20210. 

Purified: The term "purified" does not require absolute purity; rather, it is 
1 0 intended as a relative term. Thus, for example, a purified protein preparation is one in 
which the protein referred to is more pure than the protein in its natural environment 
within a cell. For example, a preparation of a protein is purified such that the protein 
represents at least 50% of the total protein content of the preparation. Similarly, a 
purified oligonucleotide preparation is one in which the oligonucleotide is more pure 
1 5 than in an environment including a complex mixture of oligonucleotides. 

Recombinant: A recombinant nucleic acid molecule is one that has a sequence 
that is not naturally occurring or has a sequence that is made by an artificial 
combination of two otherwise separated segments of sequence. This artificial 
combination can be accomplished by chemical synthesis or by the artificial 
20 manipulation of isolated segments of nucleic acid molecules, siich as by genetic 
engineering techniques. 

Sample: A specimen containing genomic DNA, RNA (including mRNA), 
protein, or combinations thereof, obtained fi-om a subject. Examples include, but are 
not limited to, peripheral blood, urine, saliva, tissue biopsy, surgical specimen, 
25 amniocentesis samples and autopsy material. 

Sequence identity/similarity: The identity/similarity between two or more 
nucleic acid sequences, or two or more amino acid sequences, is expressed in terms of the 
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identity or similarity between the sequences. Sequence identity can be measured in terms 
of percentage identity; the higher the percentage, the more identical the sequences are. 
Sequence similarity can be measured in terms of percentage similarity (which takes into 
account conservative amino acid substitutions); the higher the percentage, the more 
similar the sequences are. Homologs or orthologs of nucleic acid or amino acid 
sequences possess a relatively high degree of sequence identity/similarity when aligned 
using standard methods. This homology is more significant when the orthologous 
proteins or cDNAs are derived from species which are more closely related (such as 
human and mouse sequences), compared to species more distantly related (such as human 
and C. elegans sequences). 

Methods of alignment of sequences for comparison are well known in the art. 
Various programs and alignment algorithms are described in: Smith & Waterman, Adv. 
Appl. Math. 2:482, 1981; Needleman & Wunsch, J. Mol. Biol. 48:443, 1970; Pearson & 
Lipman, Proc. Natl. Acad. Sci. USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 
1988; Higgins & Sharp, CABIOS 5:151-3, 1989; Corpet et al.,Nuc. Acids Res. 16:10881- 
90, 1988; Huang etal Computer Appls. in the Biosciences 8, 155-65, 1992; and Pearson 
etal.,Meth. Mol. Bio. 24:307-31, 1994. Altschul a/., J. Mol. Biol. 215:403-10, 1990, 
presents a detailed consideration of sequence alignment methods and homology 
calculations. 

The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al, J. Mol. 
Biol. 215:403-10, 1990) is available from several sources, including the National Center 
for Biological Information (NCBI, National Library of Medicine, Buildmg 38 A, Room 
8N805, Bethesda, MD 20894) and on the Intemet, for use in connection with the sequence 
analysis programs blastp, blastn, blastx, tblastn and tblastx. Additional information can 
be found at the NCBI web site. 

BLASTN is used to compare nucleic acid sequences, while BLASTP is used to 
compare amino acid sequences. To compare two nucleic acid sequences, the options 
can be set as follows: -i is set to a file containmg the first nucleic acid sequence to be 
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compared (such as C:\seql.txt); -j is set to a file containing the second nucleic acid 
sequence to be compared (such as C:\seq2.txt); -p is set to blastn; -o is set to any desired ^ 
file name (such as C:\output.txt); -q is set to -1 ; -r is set to 2; and all other options are 
left at their default setting. For example, the following command can be used to 
generate an output file containing a comparison between two sequences: C:\B12seq -i 
c:\seql .txt -j c:\seq2.txt -p blastn -o c:\ou^ut.txt -q -1 -r 2. 

To compare two amino acid sequences, the options of B12seq can be set as 
follows: -i is set to a file containing the first amino acid sequence to be compared (such 
as C:\seql.txt); -j is set to a file containing the second amino acid sequence to be 
compared (such as C:\seq2.txt); -p is set to blastp; -o is set to any desired file name 
(such as C:\output.txt); and all other options are left at their default setting. For 
example, the following command can be used to generate an output file containing a 
comparison between two amino acid sequences: C:\B12seq -i c:\seql .txt -j c:\seq2.txt - 
p blastp -o c:\oulput.txt. If the two compared sequences share homology, then the 
designated output file will present those regions of homology as aligned sequences. If 
the two compared sequences do not share homology, then the designated output file will 
not present aligned sequences. 

Once aligned, the number of matches is determined by counting the number of 
positions where an identical nucleotide or amino acid residue is presented in both 
sequences. The percent sequence identity is determined by dividing the number of 
matches either by the length of the sequence set forth in the identified sequence, or by 
an articulated length (such as 100 consecutive nucleotides or amino acid residues fi-om a 
sequence set forth in an identified sequence), followed by multiplying the resulting 
value by 100. For example, a nucleic acid sequence tiiat has 1 166 matches when 
aligned with a test sequence having 1 154 nucleotides is 75.0 percent identical to the test 
sequence (i.e., 1 166-1554*100=75.0). The percent sequence identity value is rounded 
to the nearest tenth. For example, 75.11, 75.12, 75.13, and 75.14 are rounded down to 
75.1, while 75.15, 75.16, 75.17, 75.18, and 75.19 are rounded up to 75.2. The length 
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value will always be an integer. In another example, a target sequence containing a 20- 
nucleotide region that aligns with 20 consecutive nucleotides from an identified 
sequence as follows contains a region that shares 75 percent sequence identity to that 
identified sequence (that is, 15^20*100=75). 
5 1 20 

Target Sequence: AGGTCGTGTACTGTCAGTCA 

I II III MM MM I 

Identified Sequence : ACGTGGTGAACTGCCAGTGA 

For comparisons of amino acid sequences of greater than about 30 amino acids, 

1 0 the Blast 2 sequences function is employed using the default BLOSUM62 matrix set to 
default parameters, (gap existence cost of 11 , and a per residue gap cost of 1). Homologs 
are typically characterized by possession of at least 70% sequence identity counted over 
the full-length alignment with an amino acid sequence using the NCBI Basic Blast 2.0, 
gapped blastp with databases such as the nr or swissprot database. Queries searched with 

1 5 the blastn program are filtered with DUST (Hancock and Armstrong, 1 994, Comput. 
Appl. Biosci. 10:67-70). Other programs use SEG. In addition, a manual alignment can 
be performed. Proteins with even greater similarity will show increasing percentage 
identities when assessed by this method, such as at least about 75%, 80%, 85%, 90%, 
95%, 98%, or 99% sequence identity. 

20 When aligning short peptides (fewer than around 30 amino acids), the alignment is 

be performed using the Blast 2 sequences function, employing the PAM30 matrix set to 
default parameters (open gap 9, extension gap 1 penalties). Proteins with even greater 
similarity to the reference sequence will show increasing percentage identities when 
assessed by this method, such as at least about 60%, 70%, 75%, 80%, 85%, 90%, 95%, 

25 98%, 99% sequence identity. When less than the entire sequence is being compared for 
sequence identity, homologs will typically possess at least 75% sequence identity over 
short windows of 10-20 amino acids, and can possess sequence identities of at least 85%, 
90%, 95% or 98% depending on their identity to the reference sequence. Methods for 
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deteraiining sequence identity over such short windows are described at the NCBI web 
site. 

One indication that two nucleic acid molecules are closely related is that the two 
molecules hybridize to each other under stringent conditions, aS described above. Nucleic 
acid sequences that do not show a high degree of identity may nevertheless encode 
identical or similar (conserved) amino acid sequences, due to the degeneracy of the 
genetic code. Changes in a nucleic acid sequence can be made using this degeneracy to 
produce multiple nucleic acid molecules that all encode substantially the same protein. 
Such homologous nucleic acid sequences can, for example, possess at least about 60%, 
70%, 80%, 90%, 95%, 98%, or 99% sequence identity determined by this method. An 
alternative (and not necessarily cumulative) indication that two nucleic acid sequences are 
substantially identical is that the polypeptide which the first nucleic acid encodes is 
immunologically cross reactive with the polypeptide encoded by the second nucleic acid. 

One of skill in the art will appreciate that the particular sequence identity ranges 
are provided for guidance only; it is possible that strongly significant homologs could be 
obtained that fall outside the ranges provided. 

Single nucleotide polymorphism (SNP): A single base (nucleotide) difference in 
a DNA sequence among individuals in a population. SNPs can be causative (actually 
involved in or influencing the condition or trait to which the SNP is linked) or associative 
(linked to but not having any direct involvement in or influence on the condition or trait to 
which the SNP is linked). 

Subject: Living multi-cellular vertebrate organisms, a category that includes 
human and non-human mammals. 

Target sequence: A sequence of nucleotides located in a particular region in 
the human genome that corresponds to one or more specific genetic abnormalities, such 
as one or more nucleotide substitutions, deletions, insertions, amplifications, or 
combinations thereof The target can be for instance a coding sequence; it can also be 
the non-codmg strand that corresponds to a coding sequence. Examples of target 
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sequences include those sequences associated with venous thrombosis, such as those 
listed in Table 1 . 

Venous thrombosis (VT): A blood clot that forms within a vein. In particular 
examples, VT is associated with sluggish blood flow (for example as occvirs during 
5 prolonged bedrest, pregnancy, and surgery) or with rapid coagulation of the blood. 
Examples include deep venous thromboses (DVTs) that form in the deep veins of the 
legs or in the pelvic veins. Such thrombi sometimes migrate to the lungs and form 
pulmonary emboli that lead to cardiopulmonary collapse and death. 

Venous thrombosis (VT)-related (or associated) molecule: A molecule that is 
10 involved in the development of venous thrombosis. Such molecules include, for 
instance, nucleic acids (such as DNA, cDNA, or mRNAs) and proteins. Specific 
examples of VT-related molecules include those listed in Table 1, as well as fragments 
of the full-length genes or cDNAs that include the mutation(s), polymorphism(s), or 
both, responsible for increasing an individual's susceptibility to VT, and proteins and 
1 5 protein fragments encoded thereby. 

VT-related molecules can be involved in or influenced by venous thrombosis in 
many different ways, including causative (in that a change in a VT-related molecule 
leads to development of or progression to venous thrombosis) or resultive (in that 
development of or progression to venous thrombosis causes or results in a change in the 
20 VT-related molecule). 

Wild-type: A genotype that predominates in a natural population of organisms, 
in contrast to that of mutant forms. 

Mutations and Polymorphisms Involved in Venous Thrombosis 

25 Complex traits such as venous thrombosis can be understood by assuming an 

interaction between different mutations, polymorphisms, or both, in candidate 
susceptibility genes. The risk that is associated with each genetic defect may be 
relatively low in isolation but the simultaneous presence of several mutations 
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polymorphisms, or both, may dramatically increase disease susceptibility. Moreover, 
environmental factors can interact with one or more genetic variations to add further to 
the risk. Expression of a venous thrombosis phenotype is dependent on the interaction 
of gene products from several loci and environmental or acquired influences. Therefore, 
5 VT is a complex genetic disorder. 

Several mutations and polymorphisms, such as one or more nucleotide 
substitutions, insertions, and deletions in genes associated with a risk of developing VT 
are known. However, a combination of such mutations and polymorphisms that permit 
accurate prediction of a subject's genetic predisposition to VT, in multiple ethnic 
1 0 groups, has not been previously identified. 

Factor V, prothrombin, and fibrinogen 

Several genes involved in venous thrombosis include Factor V (FV), 

prothrombin (Factor II), and fibrinogen, are involved in procoagulant pathways. 
1 5 Altered activity of mutated F V is the most common hereditary blood coagulation 

disorder that affects development of VT (Nicolaes et al, Arterioscler. Thromb. Vase. 

Biol. 22:530-8, 2002). Downregulation of the procoagulant activity of activated FV 

(FVa) is accomplished by activated protein C (APC)-mediated proteolysis of FVa at 

three different s^uential cleavage sites: Arg 506, Arg 306, and Arg 679 (the 
20 numbering of nucleotides or amino acids herein refer to human genes). A defect at one 

or more of these three cleavage sites can affect the APC inactivation process even 

though procoagulation activity may remain normal. 

At least five recurrent single nucleotide substitutions in the human FV gene are 

associated with increased thrombosis risk. In 90% of cases, resistance to APC due to a 
25 single nucleotide substitution (F V Leiden; 1 69 1 G-> A) that results in the replacement of 

Arg506 with Ghi (R506Q). Its prevalence in Caucasian populations is approximately 

5% and is as high as 20% to 40% m patients with VT. However, very few cases of FV 
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Leiden have been reported among other races (Takamiya et al., Thromb. Haemost. 
74:996, 1995; Fujimura etal, Thromb. Haemost J A.UU-l, 1995; Chan etal, Thromb. 
Haemost. 75:522-3, 1996). 

Another single nucleotide substitution in the FV gene, R485K, is associated with 
increased thrombosis risk in Far East populations (Hiyoshi et al, Thromb. Haemost. 
80:705-6, 1998; Le etal, Clin. Genet. 57:296-303, 2000). The R485K polymorphism 
is a G ^ A transition occurring at nucleotide 1 628 and results in the replacement of the 
codon AGA of Arg 485 by an AAA codon predicting a Lys residue. Although the 
frequency of the K485 allele is low in Caucasians and high in Asians, this 
polymorphism is associated with increased thrombosis risk in both Far East and 
Caucasian populations (Dogulu a/., Thromb. Res. 111:389-95, 2003). 

Three other single nucleotide substitutions are associated with increased risk of 
thrombosis in different populations. Two mutations in exon 7 of the human FV gene 
affect the Arg306 APC cleavage site. These two mutations also have a heterogeneous 
racial distribution. The FV Cambridge mutation is a G to C transition at nucleotide 
position 1091 and predicts replacement of arginine with a threonine at amino acid 
position 306 (Arg306Thr). This mutation has only been described in Caucasian 
populations (Franco et al, Thromb. Haemost. 81:312-3, 1999). The second mutation, 
FV Hong Kong, is an A to G transition at nucleotide position 1090 and changes Arg306 
to Gly. Although this mutation was originally described in Chinese populations, it has a 
prevalence of 0.4% in Caucasians (Franco et al, Thromb. Haemost. 81 :3 12-3, 1999). 

Another single nucleotide substitution in exon 13 of the human FV gene, 
referred to as the R2 allele, is an A to G transition at nucleotide position 4070, which 
replaces His by Arg at position 1299 (H1299R). The prevalence of R2 allele is 
significantly higher in the patients with VT than in the healthy controls, with respective 
values of 18.5% and 11.4% (Alhenc-Gelasefa/., Thromb. Haemost. 81:193-7, 1999). 
This polymorphism has a prevalence of 1 1 .9% in U.S. Caucasians, 5.6% in Afiican- 
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Americans, 13.4% in Asian or Pacific Islanders and 1 1.3% in Hispanics (Benson et al, 
Thromb. Haemost. 86:1188-92, 2001). 

At least one single nucleotide substitution is associated with increased 
thrombosis risk in the prothrombin (Factor II, FII) gene. The G->- to A polymorphism 
at nucleotide 20210 in the 3'-UT region of the prothrombin gene is the second most 
fi-equently inherited risk factor for venous thrombosis (Poort et al. Blood 88:3698-703, 
1996). FII G20210A is associated with hyperprothrombinemia and a two- to five fold 
increased risk of VT and is found in l%-3% of subjects in the general population and in 
6%-18% of patients with VT. The allele frequency varies in Caucasian populations, 
ranging fi-om 1.2% to 4% (Rosendaal etal, Thromb. Haemost 79:706-8, 1998), but is 
quite rare in African Americans and Amerindians fi-om Brazil (Dilley et al. , Blood 
90:652a, 1997; Arrudaefa/., Thromb. Haemost. 78:1430-3, 1997). This mutation has 
not been found in Somalians, people of West African origin, Amazonian Indians, or 
Japanese subjects (Ferraresi a/., .<4r/er/ojc/er. Thromb. Vase. Biol. 17:2418-22, 1997; 
Rahimy etal, Thromb. Haemost. 79:444-5, 1998; Isshiki etal. Blood Coagul Fibrinol 
9:105-6, 1998; Miyata et al. Blood Coagul Fibrinol 9:451-2, 1998). 

At least 25 thrombophilic fibrinogen mutations (22 single nucleotide 
substitutions, 1 insertion and two deletions) are associated with VT (De Stefano et al, 
Br. J. Haematol 106:564-8, 1999). At least thirteen of the mutations are from different 
unrelated kindreds; the remaining mutations are unique to single families, making them 
individual mutations. The prevalence of inherited dysfibrinogenemia among the general 
population is unknown; however, the prevalence among patients with a history of 
venous thrombosis is 0.8 % (Carter et al. Blood 96:1 177-9, 2000). A conunon 
polymorphism leading to a substitution of threonine by alanine at codon 312 
(Thr312Ala polymorphism) within the carboxy-terminal end of the fibrinogen Aa chain 
is associated with venous thromboembolism via influencing clot stability and 
predisposing clots to embolization in the venous vascular trees (Carter et al. Blood 
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96:1 177-9, 2000; Standeven et al. Circulation 107:2326-30, 2003; Hayes, Arch. 
Pathol. Lab. Med. 126:1387-90, 2002). The polymorphism is observed in 51% of 
patients with pxxlmonary embolism and in 40% of healthy subjects. No differences have 
been found in genotype distribution for Thr312Ala polymorphism in Caucasians and 
Asians and this polymorphism is associated with elevated fibrinogen levels in both 
populations (Lmetal,! Med. Genet. 38:31-5, 2001; Kain et al.. Am. J. Epidemiol. 
156:174-9, 2002). 

Protein C, Protein S, and Antithrombin III 

Several other genes involved in venous thrombosis, including protein C (PC), 
protein S (PS), and antithrombin III, are involved in anticoagulant pathways. PC and 
PS deficiencies result in defects in the activated PC anticoagulant system. 

At least 161 different detrimental PC gene mutations have been reported in 
humans (Reitsma et al., Thromb. Haemost. 73:876-89, 1995). Among these 161 
different PC gene mutations, only 51 distinct mutations (48 point mutations, 2 deletions 
and 1 insertion) have been described in multiple unrelated kindreds and the remaining 
109 mutations have been unique to single families which makes them individual 
mutations. Forty recurrent mutations are associated with type I PC deficiency and 1 1 
mutations were observed in patients with type II PC deficiency. Three polymorphic 
sites (nt -1654C/T, -1641A/G and -1476A/T) are located in the 5' untranslated region of 
the gene; two (the nt -1654C/T and -1641 A/G transitions) have an effect on plasma PC 
levels. 

PS deficiency has a highly heterogeneous molecular basis with at least 131 
different mutations (Gandrille et al., Thromb. Haemost. 84:918, 2000). Among all PS 
gene detrimental mutations, only 32 distinct mutations (25 point mutations, 3 deletions, 
3 insertions, and 1 deletion and insertion) have been described in multiple unrelated 
kindreds and the remaining 100 mutations have been unique to single families which 
makes them individual mutations. Twenty-five recurrent mutations have been reported 
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to be associated with quantitative (type I and/or type III) PS deficiency, and 3 mutations 
with qualitative (type II) PS deficiency. Eight recurrent polymorphisms in the PS gene 
cosegregate with the deficient phenotype in families with hereditary PS deficiency. 

The prevalence of PC deficiency in the general population is approximately 
1/300. The carrier state for PC and PS deficiencies is associated with approximately a 
10-fold increased thrombosis risk for VT. Homozygous PC and PS deficiency is 
usually associated with a severe clinical phenotype known as purpura fulminans, 
characterized by extensive thromboses in the microcirculation early after birth. 

Heterozygous antithrombin III (AT III) deficiency is associated with increased 
risk for VT. There are at least 127 distinct defects (Lane et al, Thromb. Haemost. 
77: 197-21 1 , 1997) associated with AT III deficiency: 92 mutations for type I AT III 
deficiency (40 point mutations, 40 small insertions or deletions and 12 large deletions) 
and 35 mutations for type II AT III deficiency (12 RS, 12 HBS and 1 1 PE mutations, all 
point mutations). Among the type I mutations, only 1 1 distinct mutations (7 point 
mutations and 4 deletions or insertions) have been described in multiple unrelated 
kindreds and the remaining 81 mutations have been unique to single families which 
makes them individual mutations. In type II, 19 of the 35 mutations (seven RS, six 
HBS and six PE mutations) have been described in multiple unrelated kindreds and the 
remaining 16 have been reported to be individual mutations. 

In addition to causative AT III gene mutations in subjects with hereditary AT III 
deficiency, five distinct AT III gene polymorphisms cosegregate with the deficient 
phenotype in families with hereditary AT III deficiency. These polymorphisms are also 
fi-equent in the healthy population. The prevalence of AT III deficiency in the general 
population ranges from 0.2/1000 to 18/1000. In a population-based control study, a 
five-fold increased risk for VT linked AT III deficiency was reported. The prevalence of 
AT III deficiency in thrombosis patients ranges from 1% to 8%. 
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Angwtensin I-converting enzyme and methylenetetrahydrofolate reductase 

Additional genes involved in venous thrombosis include, but are not limited to, 
angiotensin I-converting enzyme (ACE) and methylenetetrahydrofolate reductase 
(MTHFR). 

The renin angiotensin system affects hemostasis through different mechanisms. 
In intron 16 of the human ACE gene, a polymorphism consisting of an insertion or 
deletion of a 288-bp fragment is known (Rigat et al, Nuc. Acids Res. 20:1433, 1992). 
The ACE DD genotype is associated with increased levels of circulating enzyme and 3 
to 10-fold increased risk to venous thromboembolism among Caucasians and African- 
Americans. The ACE DD genotype has also been reported in the Japanese population. 

Mild-to-moderate hyperhomocysteinemia (fasting levels of total homocysteine 
between 15 and 100 jimol/) is an established risk factor for VT and is associated with 
two- to four fold increased risk of thrombosis. Although it can be caused by several 
acquired causes including nutritional deficiencies of vitamin B12, vitamin B6 and 
folate, advanced age, chronic renal failure and the use of anti-folic drugs, two common 
polymorphisms in the methylenetetrahydrofolate reductase (MTHFR) gene are 
associated with mild-to moderate hyperhomocysteinemia (Cattaneo, Thromb. Haemost. 
81:165-76, 1999; Franco and Reitsma, Hum. Genet. 109:369-84, 2001). 

MTHFR 677 C^T polymorphism is located in human exon 4 at the folate 
binding site, converting an alanine into a valine. In its homozygous state, C677T 
polymorphism is associated with thermolability of MTHFR, leading to 60-70% 
reduction of the enzymatic activity and mild to moderate hyperhomocysteinemia 
(Franco and Reitsma, Hum. Genet. 109:369-84, 2001; Frosst et al, Nat. Genet. 10:1 1 1- 
3, 1995). The C677T polymorphism in human MTHFR has a relatively high frequency 
throughout the world, TT genotype is present in about 5% to 17% of the general 
population with a very heterogeneous distribution among different ethnic groups, 
highest prevalence in Europe and lowest prevalence in Africa (Frosst et al, Nat. Genet. 
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10:1 1 1-3, 1995; Schneider et al. Am. J. Hum. Genet. 62:1258-60, 1998; De Franchis et 
al.,Am. J. Hum. Genet. 59:262-4, 1996; Ma etal.. Circulation. 94:2410-6, 1996; 
Deloughery et al.. Circulation 94:3074-8, 1996; Arruda V et al., Thromb. Haemost. 
77:818-21, 1997). Homozygous MTHRF C677T polymorphism is an independent risk 
factor for venous thrombosis with a prevalence of 1 1%-27% in venous thrombosis 
patients in Caucasians but not associated v^dth VT in Asians and Africans (Arruda et al, 
Thromb. Haemost. 77:818-21, 1997; Margaglione etal, Thromb. Haemost. 79:907-11, 
1998; ^d\om6netal,Arterioscler. Thromb. Vase. Biol 19:511-8, 1999). 

Another MTHFR polymorphism, 1298 A->C, is in human exon 7 within the 
presumptive regulatory domain and converts a glutamine into an alanine. By itself, this 
polymorphism does not appear to be associated with hyperhomocysteinemia but 
compound heterozygosity with MTHFR 677 C^T results in decreased enzyme activity 
and increased homocysteine levels (Weisberg etal, Mol Genet Metab. 68:51 1-2, 
1999). 

Determining Genetic Predisposition to Venous Thrombosis 

Provided herein are methods of determining whether a subject, such as an 
otherwise healthy subject, or a subject suspected or at risk of developing thrombi, is 
susceptible to developing venous thrombosis (VT). The methods mvolve detecting an 
abnormality (such as a mutation, polymorphism, or both, herein 
mutation/polymorphism) in at least one VT-related molecule of the subject, such as a 
nucleic acid that encodes a coagulation-related protein. Specific encompassed 
embodiments include diagnostic or prognostic methods in which one or more mutations, 
polymorphisms, or both, in a VT-related nucleic acid molecule in cells of the individual 
is detected. In particular embodiments, an abnormality is detected in a subset of VT- 
related molecules (such as nucleic acid sequence) that selectively detect a predisposition 
of a subject to develop VT. In particular examples, the subset of molecules includes a 
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set of 10 VT-related susceptibility alleles associated with venous thrombotic events, 
wherein the 10 VT-related susceptibility alleles are present in at least 95% of 
Caucasians subjects who are at risk for (or who have experienced) a venous thrombosis. 
In particular examples, the 10 VT-related susceptibility alleles are present in at least 
98% of Caucasians, such as at least 99%, and at least 85% of Asians and African 
populations, such as at least 88%, who have or are at risk of developing a VT. 

In yet other examples, the number of VT-related susceptibility alleles screened is 
at least 10, for example at least 15, at least 20, at least 50, at least 100, at least 15 1, at 
least 200, at least 302, or even at least 500 alleles. In other examples, the methods 
employ screening no more than 600, no more than 500, no more than 400, no more than 
302, no more than 200, no more than 151, no more than 100, no more than 50, or no 
more than 10 VT-related susceptibility alleles. Examples of particular VT-related 
susceptibility alleles are shown in Table 1. 

As used herein, the term "VT-related molecule" includes VT-related nucleic acid 
molecules (such as DNA, RNA or cDNA) and VT-related proteins. The term is not 
limited to those molecules listed in Table 1 (and molecules that correspond to those 
listed), but also includes other nucleic acid molecules and proteins that are influenced 
(such as to level, activity, localization) by or during venous thrombosis, including all of 
such molecules listed herein. Examples of VT-related genes include factor V, 
prothrombin (factor II), fibrinogen, protein C, protein S, antithrombm III, angiotensin-I 
converting enzyme (ACE), and methylenetetrahydroflate reductase (MTHFR). In 
certain examples, abnormalities are detected in at least one VT-related nucleic acid, for 
instance in at least 2, 3, 4, 5, 6, 7, 8, 10, 15 or more VT-related nucleic acid molecules. 
In particular examples, certain of the described methods employ screening no more than 
100, no more than 50, no more than 40, no more than 30, no more than 20, or no more 
than 15 VT-related genes. 

This disclosed method (MERT) provides a rapid, straightforward, accurate and 
affordable multiple genetic screening method performed in one assay for inherited 
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venous thrombosis susceptibility with a high predictive power for identification of 
asymptomatic carriers. It allows early recognition of subjects who may require 
prophylactic anticoagulant therapy during high risk situations, such as pregnancy, 
puerperium, use of oral contraceptives or hormone replacement therapy, trauma, 
surgery, fractures, prolonged immobilization, long air joumeys (such as those more than 
4 hours), advanced age, antiphospholipid antibodies, previous thrombosis history, 
myeloproliferative disorders, malignancy, or combinations thereof. The disclosed assay 
can be used to reduce the yearly incidence of venous thrombosis by early identification 
of individuals at inherited risk. By detecting individuals before they develop symptoms, 
effective preventive measures, such as early thromboprophylaxis or even decisions such 
as avoiding the use of oral contraceptives or hormone replacement therapy, can be 
instituted. 

As discussed above, there are differences in the causes of the inherited venous 
thrombosis among different ethnic groups. Whereas FV Leiden and prothrombin 
G0210A polymorphisms are the most prevalent risk factors for venous thrombosis in 
Caucasians, Asian and African patients exhibit no or very rare FV Leiden or 
prothrombin G20210A polymorphisms. The disclosed methods and arrays are designed 
to determine inherited venous thrombophilia risk not only in Caucasians but also 
diverse ethnic populations. In one particular example, the method has a high predictive 
power in different ethnic populations (such as at least 98% for Caucasians, at least 88% 
for Asians and at least 90% for Africans). In other examples, the method detects 
abnormalities in VT-related molecules (such as nucleic acid sequences) wherein the 
abnormalities are found in at least 99% of Caucasians, at least 88% of Asians, and at 
least 91% of Africans who have had a VT. Therefore, the applicability of the disclosed 
methods and arrays in diverse ethnic populations makes it a powerfid approach. 

In particular examples, the disclosed methods and arrays are cost-effective 
compared to the currently available plasma-based thrombophilia screening panel, which 
includes antigenic and activity based determination of protein C and S, AT III antigen 
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and activity, thrombin time and reptilase time for dysfibrinogenemia, quantitative 
determination of fibrinogen level and PCR-based direct mutation analysis of FV Leiden, 
prothrombin 2021 OA and MTHFR polymorphisms. 

For example, the disclosed method provides advantages over AT III deficiency 
assays, because the disclosed method detects type I and II deficiencies in one assay 
instead of two subsequent tests, and detects variant AT III type II defects that can 
unfortunately be missed due to long incubation times of many automated functional 
heparin cofactor assay analyzers used in clinical practice, and by avoiding high rates of 
false positivity. 

In other examples, the disclosed method provides advantages over PC deficiency 
assays, including the clotting assay of functional PC level, immunological PC assay and 
chromogenic assay of PC activity, because the disclosed method can in one embodiment 
detect type I and II deficiencies in one assay instead of three subsequent tests and 
overcomes the difficulty of distinguishing healthy subjects fi-om asymptomatic PC 
deficient individuals due to the presence of a significant overlap between low normal 
levels and mild PC deficiency, by avoiding the underdetermination of PC deficient 
individuals because of the increase in PC concentration as a function of age which is 
approximately 4% per decade, and by avoiding high rates of false positivity. 

In other examples, the disclosed method provides advantages over PS deficiency 
assays, which include clotting assay of functional PS level, immunoassay of PS, and 
enzyme-linked immunosorbent assays for total and fi-ee PS measurements because the 
disclosed method can in one embodiment detect quantitative and functional defects in 
one assay instead of four subsequent phenotypic assays and overcomes the difficulty in 
the diagnosis of PS deficiency with the immunologic assay which is complicated by the 
presence of two molecular forms of PS in the plasma (free PS and C4b-BP/PS 
complexes), overcomes the difficulty of distinguishing healthy subjects from 
asymptomatic PS deficient mdividuals due to the presence of overlapping values 
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between controls and PS deficient individuals, especially those with type III PS 
deficiency, and by avoiding high rates of false positivity. 

In other examples, the disclosed method provides an advantage over assays for 
dysfibrinogenemia which include thrombin time and reptilase time as first line tests, 
5 because the disclosed method avoids high rates of false positivity. 

Clinical Specimens 

Appropriate specimens for use with the current disclosure in determining a 
subject's genetic predisposition to VT include any conventional clinical samples, for 

1 0 instance blood or blood-fi-actions (such as serum). Techniques for acquisition of such 
samples are well known in the art (for example see Schluger eial. J. Exp. Med. 
176:1327-33, 1992, for the collection of serum samples). Serum or other blood 
firactions can be prepared in the conventional manner. For example, about 200 ^iL of 
serum can be used for the extraction of DNA for use in amplification reactions. 

1 5 Once a sample has been obtained, the sample can be used directly, concentrated 

(for example by centrifiigation or filtration), purified, or combinations thereof, and an 
amplification reaction performed. For example, rapid DNA preparation can be 
performed using a commercially available kit (such as the InstaGene Matrix, BioRad, 
Hercules, CA; the NucliSens isolation kit, Organon Teknika, Netherlands). In one 

20 example, the D^A preparation method yields a nucleotide preparation tiiat is accessible 
to, and amenable to, nucleic acid amplification. 

Amplification of nucleic acid molecules 

The nucleic acid samples obtained fi-om the subject to obtain amplification 
25 products, including sequences fi-om factor V, protiirombin (factor II), fibrinogen, protein 
C, protein S, AT III, ACE, and MTHFR, can be amplified fi-om the clinical sample prior 
to detection. In one example, DNA sequences are amplified. In another example, RNA 
sequences are amplified. 
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Any nucleic acid amplification method can be used. In one specific, non- 
limiting example, polymerase chain reaction (PGR) is used to amplify the nucleic acid 
sequences associated with venous thrombosis. Other exemplary methods include, but 
are not limited to, RT-PCR and transcription-mediated amplification (TMA). 

The target sequences to be ampUfied fi-om the subject include factor V, factor II, 
fibrinogen, protein C, protein S, AT III, ACE, and MTHFR. In particular examples, the 
VT-associated target sequences to be amplified consist essentially of, or consist only of 
factor V, factor II, fibrinogen, protein C, protein S, AT III, ACE, and MTHFR. 

A pair of primers can be utilized in the amplification reaction. One or both of 
the primers can be labeled, for example with a detectable radiolabel, fluorophore, or 
biotin molecule. The pair of primers includes an upstream primer (which binds 5' to the 
downstream primer) and a downstream primer (which binds 3' to the upstream primer). 
The pair of primers used in the amplification reaction are selective primers which permit 
amplification of a nucleic acid involved in venous thrombosis. Primers can be selected 
to amplify a nucleic acid molecule listed in Table 1, or represented by those listed in 
Table 1. 

An additional pair of primers can be included in the amplification reaction as an 
internal control. For example, these primers can be used to amplify a "housekeeping" 
nucleic acid molecule, and serve to provide confirmation of appropriate amplification. 
In another example, a target nucleic acid molecule including primer hybridization sites 
can be constructed and included in the amplification reactor. One of skill in the art will 
readily be able to identify primer pairs to serve as internal control primers 

Arrays for detecting nucleic acid and protein sequences 

In particular examples, methods for detecting an abnormality in at least one VT- 
related gene use the arrays disclosed herein. Such arrays can include nucleic acid 
molecules. In one example, the array includes nucleic acid oligonucleotide probes that 
can hybridize to wild-type or mutant/polymorphic VT gene sequences, such as factor 
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V, prothrombin (factor II), fibrinogen, protein C, protein S, AT III, ACE, and MTHFR. 
In a particular example, an array includes oligonucleotides that can recognize all 151 
VT-associated recurrent mutations and polymorphisms listed in Table 1, such as the 
oligonucleotide probes shown in odd numbered SEQ ID NOS: 1-301. In other 
5 examples, an array includes oligonucleotide probes that can recognize both mutant and 
wild-type factor V, prothrombin (factor II), fibrinogen, PC, PS, AT III, ACE, and 
MTHFR sequences, such as SEQ ID NOS: 1-302. Certain of such arrays (as well as 
the methods described herein) can include VT-related molecules that are not listed in 
Table 1. 

10 Arrays can be used to detect the presence of amplified sequences involved in 

venous thrombosis, such as factor V, prothrombin (factor II), fibrinogen, protein C, 
protein S, antithrombin III, ACE, and MTHFR sequences, using specific 
oligonucleotide probes. The arrays herein termed "VT detection arrays," are used to 
determine the genetic susceptibility of a subject to developing venous thrombosis. In 

1 5 one example, a set of oligonucleotide probes is attached to the surface of a solid 

support for use in detection of the VT-associated sequences, such as those amplified 
nucleic acid sequences obtained from the subject. Additionally, if an internal control 
nucleic acid sequence was amplified in the amplification reaction (see above), an 
oligonucleotide probe can be included to detect the presence of this amplified nucleic 

20 acid molecule. 

The oligonucleotide probes bound to the array can specifically bind sequences 
amplified in the amplification reaction (such as under high stringency conditions). 
Thus, sequences of use with the method are oligonucleotide probes that recognize the 
VT-related sequences, such as factor V, prothrombin (factor II), fibrinogen, PC, PS, 

25 antithrombin III, ACE, and MTHFR gene sequences. Such sequences can be 

determined by examining the sequences of the different species, and choosing primers 
that specifically anneal to a particular wild-type or mutant sequence (such as those 
listed in Table 1 or represented by those listed m Table 1), but not others. One of skill 
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in the art will be able to identify other VT-associated oligonucleotide molecules that 
can be attached to the surface of a solid support for the detection of other amplified 
VT-associated nucleic acid sequences. 

The methods and apparatus in accordance with the present disclosure takes 
5 advantage of the fact that under appropriate conditions oUgonucleotides form base- 
paired duplexes with nucleic acid molecules that have a complementary base sequence. 
The stability of the duplex is dependent on a number of factors, including the length of 
the oligonucleotides, the base composition, and the composition of the solution in which 
hybridization is effected. The effects of base composition on duplex stability may be 
1 0 reduced by carrying out the hybridization in particular solutions, for example in the 
presence of high concentrations of tertiary or quaternary amines. 

The thermal stability of the duplex is also dependent on the degree of sequence 
similarity between the sequences. By carrying out the hybridization at temperatures 
close to the anticipated T„'s of the type of duplexes expected to be formed between the 
1 5 target sequences and the oligonucleotides bound to the array, the rate of formation of 
mis-matched duplexes may be substantially reduced. 

The length of each oligonucleotide sequence employed in the array can be 
selected to optimize binding of target VT-associated nucleic acid sequences. An 
optimum length for use with a particular VT-associated nucleic acid sequence under 
20 specific screening conditions can be determined empirically. Thus, the length for each 
individual element of the set of oligonucleotide sequences including in the array can be 
optimized for screening. In one example, oligonucleotide probes are from about 20 to 
about 35 nucleotides in length or about 25 to about 40 nucleotides in length. 

The oligonucleotide probe sequences forming the array can be directly linked to 
25 the support. Alternatively, the oligonucleotide probes can be attached to the support by 
non-VT-associated sequences such as oligonucleotides or other molecules that serve as 
spacers or linkers to the solid support. 
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In another example, an array includes protein sequences, which include at least 
one VT-related protein such as one encoded by a nucleic acid molecule listed in Table 1 
(or genes, cDNAs or other polynucleotide molecules including one of the listed 
sequences, or a fragment thereof), or a fragment of such protein, or an antibody specific 
to such a protein or protein fragment. Such arrays can also contain any particular subset 
of the nucleic acids (or corresponding molecules) listed in Table 1 . The proteins or 
antibodies forming the array can be directly linked to the support. Alternatively, the 
proteins or antibodies can be attached to the support by spacers or linkers to the solid 
support. 

Abnormalities in VT-related proteins can be detected using, for instance, a VT 
protein-specific binding agent, which m some instances will be detectably labeled. In 
certain examples, therefore, detecting an abnormality includes contacting a sample from 
the subject with a VT protein-specific binding agent; and detecting whether the binding 
agent is bound by the sample and thereby measuring the levels of the VT-related protein 
present in the sample, in which a difference in the level of VT-related protein in the 
sample, relative to the level of VT-related protein foimd an analogous sample from a 
subject not predisposed to developing VT, or a standard VT-related protein level in 
analogous samples from a subject not having a predisposition for developing VT, is an 
abnormality in that VT-related molecule. 

The solid support can be formed from an organic polymer. Suitable materials 
for the solid support include, but are not limited to: polypropylene, polyethylene, 
polybutylene, polyisobutylene, polybutadiene, polyisoprene, polyvinylpyrrolidine, 
polytetrafluroethylene, polyvinylidene difluroide, polyfluoroethylene-propylene, 
polyethylenevinyl alcohol, polymethylpentene, polycholorotrifluoroethylene, 
polysulfomes, hydroxylated biaxially oriented polypropylene, aminated biaxially 
oriented polypropylene, thiolated biaxially oriented polypropylene, etyleneacrylic acid, 
thylene methacryhc acid, and blends of copolymers thereof (see U.S. Patent No. 
5,985,567, herein incorporated by reference). 
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In general, suitable characteristics of the material that can be used to form the 
solid support surface include: being amenable to surface activation such that upon 
activation, the surface of the support is capable of covalently attaching a biomolecule 
such as an oligonucleotide thereto; amenability to "in situ" synthesis of biomolecules; 
5 being chemically inert such that at the areas on the support not occupied by the 
oligonucleotides are not anaenable to non-specific binding, or when non-specific 
binding occurs, such materials can be readily removed from the surface without 
removing the oligonucleotides. 

In one example, the solid support surface is polypropylene. Polypropylene is 

10 chemically inert and hydrophobic. Non-specific binding is generally avoidable, and 
detection sensitivity is improved. Polypropylene has good chemical resistance to a 
variety of organic acids (such as formic acid), organic agents (such as acetone or 
ethanol), bases (such as sodium hydroxide), salts (such as sodium chloride), oxidizing 
agents (such as peracetic acid), and mineral acids (such as hydrochloric acid). 

1 5 Polypropylene also provides a low fluorescence background, which minimizes 
background interference and increases the sensitivity of the signal of interest. 

In another example, a surface activated organic polymer is used as the solid 
support surface, One example of a surface activated organic polymer is a polypropylene 
material aminated via radio fi-equency plasma discharge. Such materials are easily 

20 utilized for the attachment of nucleotide molecules. The amine groups on the activated 
organic polymers are reactive with nucleotide molecules such that the nucleotide 
molecules can be bound to the polymers. Other reactive groups can also be used, such 
as carboxylated, hydroxylated, thiolated, or active ester groups. 

A wide variety of array formats can be employed in accordance with the present 

25 disclosure. One example includes a linear array of oligonucleotide bands, generally 
referred to in the art as a dipstick. Another suitable format includes a two-dimensional 
pattern of discrete cells (such as 4096 squares in a 64 by 64 array). As is appreciated by 
those skilled in the art, other array formats including, but not limited to slot 
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(rectangular) and circular arrays are equally suitable for use (see U.S. Patent No. 
5,981,185, herein incorporated by reference). In one example, the array is formed on a 
polymer medium, which is a thread, membrane or film. An example of an organic 
polymer medium is a polypropylene sheet having a thickness on the order of about 1 
5 mil. (0.001 inch) to about 20 mil., although the thickness of the film is not critical and 
can be varied over a fairly broad range. Particularly disclosed for preparation of arrays 
at this time are biaxially oriented polypropylene (BOPP) films; in addition to their 
durability, BOPP films exhibit a low background fluorescence. In a particular example, 
the array is a solid phase, Allele-Specific Oligonucleotides (ASO) based nucleic acid 
10 array. 

The array formats of the present disclosure can be included in a variety of 
different types of formats. A "format" includes any format to which the solid support 
can be affixed, such as microtiter plates, test tubes, inorganic sheets, dipsticks, and the 
like. For example, when the solid support is a polypropylene thread, one or more 

1 5 polypropylene threads can be affixed to a plastic dipstick-type device; polypropylene 
membranes can be affixed to glass slides. The particular format is, in and of itself, 
unimportant. All that is necessary is that the solid support can be affixed thereto 
without affecting the fimctional behavior of the solid support or any biopolymer 
absorbed thereon, and that the format (such as the dipstick or slide) is stable to any 

20 materials into which the device is introduced (such as clinical samples and hybridization 
solutions). 

The arrays of the present disclosure can be prepared by a variety of approaches. 
In one example, oligonucleotide or protein sequences are synthesized separately and 
then attached to a solid support (see U.S. Patent No. 6,013,789, herein incorporated by 
25 reference). In another example, sequences are synthesized directly onto the support to 
provide the desired array (see U.S. Patent No. 5,554,501, herein incorporated by 
reference). Suitable methods for covalently coupling oligonucleotides and proteins to a 
solid support and for directly synthesizing the oligonucleotides or proteins onto the 
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support are known to those working in the field; a summary of suitable methods can be 
found in Matson et al.,Anal. Biochem. 217:306-10, 1994. In one example, the 
oligonucleotides are synthesized onto the support using conventional chemical 
techniques for preparing oligonucleotides on solid supports (such as see PCT 
applications WO 85/01051 and WO 89/10977, or U.S. Patent No. 5,554,501, herein 
incorporated by reference). 

A suitable array can be produced using automated means to synthesize 
oligonucleotides in the cells of the array by laying down the precmrsors for the four 
bases in a predetermined pattern. Briefly, a multiple-channel automated chemical 
delivery system is employed to create oligonucleotide probe populations in parallel 
rows (corresponding in number to the number of channels in the delivery system) across 
the substrate. Following completion of oligonucleotide synthesis in a first direction, the 
substrate can then be rotated by 90° to permit synthesis to proceed within a second (2°) 
set of rows that are now perpendicular to the first set. This process creates a multiple- 
channel array whose intersection generates a plurality of discrete cells. 

The oligonucleotides can be bound to the polypropylene support by either the 3' 
end of the oligonucleotide or by the 5' end of the oligonucleotide. In one example, the 
oligonucleotides are bound to the solid support by the 3' end. However, one of skill in 
the art can determine whether the use of the 3' end or the 5' end of the oligonucleotide is 
suitable for bonding to the solid support. In general, the internal complementarity of an 
oligonucleotide probe in the region of the 3' end and the 5' end determines binding to 
the support. 

In particular examples, the oligonucleotide probes on the array include one or 
more labels, that permit detection of oligonucleotide probe :target sequence 
hybridization complexes. 
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Detection of nucleic acids and proteins 

The nucleic acids and proteins obtained from the subject may contain one or 
more insertions, deletions, substitutions, or combinations thereof in one or more genes 
associated with venous thrombosis, such as those listed in Table 1 . Such mutations or 
polymorphisms (or both) can be detected to determine if the subject has a genetic 
disposition to developing venous thrombosis. Any method of detecting a nucleic acid 
molecule or protein can be used, such as physical or functional assays. 

Methods for labeling nucleic acid molecules and proteins, such that tiiey can be 
detected, are well known. Examples of such labels include non-radiolabels and 
radiolabels. Non-radiolabels include, but are not limited to an enzyme, 
chemiluminescent compovmd, fluorescent compound, metal complex, hapten, enzyme, 
colorimetric agent, a dye, or combinations thereof. Radiolabels include, but are not 
limited to, '"I and "S. For example, radioactive and fluorescent labeling methods, as 
well as other methods known in the art, are suitable for use with the present disclosure. 
In one example, the primers used to amplify the subject's nucleic acids are labeled (such 
as with biotin, a radiolabel, or a fluorophore). In another example, the amplified nucleic 
acid samples are end-labeled to form labeled amplified material. For example, 
amplified nucleic acid molecules can be labeled by including labeled nucleotides in the 
amplification reactions. In a particular example, proteins obtained fi-om a subject are 
labeled and subsequently analyzed, for example by applying them to an array. 

The amplified nucleic acid molecules associated with venous thrombosis are 
applied to the VT detection array under suitable hybridization conditions to form a 
hybridization complex. In particular examples, the amplified nucleic acid molecules 
include a label. In one example, a pre-treatment solution of organic compoimds, 
solutions that include organic compounds, or hot water, can be, applied before 
hybridization (see U.S. Patent No. 5,985,567, herein incorporated by reference). 

Hybridization conditions for a given combination of array and target material 
can be optimized routinely in an empirical manner close to the T„ of the expected 
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duplexes, thereby maximizing the discriminating power of the method. Identification of 
the location in the array, such as a cell, in which binding occurs, permits a rapid and 
accurate identification of sequences associated with venous thrombosis present in the 
amplified material (see below). 
5 The hybridization conditions are selected to permit discrimination between 

matched and mismatched oligonucleotides. Hybridization conditions can be chosen to 
correspond to those known to be suitable in standard procedures for hybridization to 
filters and tiien optimized for use with the arrays of the disclosure. For example, 
conditions suitable for hybridization of one type of target would be adjusted for the use 

1 0 of other targets for the array. In particular, temperature is controlled to substantially 
eliminate formation of duplexes between sequences other than exactiy complementary 
VT-associated wild-type of mutant sequences. A variety of known hybridization 
solvents can be employed, the choice being dependent on considerations known to one 
of skill in the art (see U.S. Patent 5,981,185, herein incorporated by reference). 

1 5 Once the amplified nucleic acid molecules associated with venous thrombosis 

have been hybridized with the oligonucleotides present in the VT detection array, the 
presence of the hybridization complex can be analyzed, for example by detecting the 
complexes. 

Detecting a hybridized complex in an array of oligonucleotide probes has been 
20 previously described (see U.S. Patent No. 5,985,567, herein incorporated by reference). 
In one example, detection includes detecting one or more labels present on the 
oligonucleotides, the amplified sequences, or both. In particular examples, developing 
includes applying a buffer. In one embodiment, the buffer is sodium saline citrate, 
sodiimi saline phosphate, tetramethylammonium chloride, sodium saline citrate in 
25 ethylenediaminetetra-acetic, sodium saline citrate in sodiiun dodecyl sulfate, sodium 
saline phosphate in ethylenediaminetetra-acetic, sodium saline phosphate in sodium 
dodecyl sulfate,, tetramethylammonium chloride in ethylenediaminetetra-acetic, 
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tetramethylammonium chloride in sodiiim dodecyl sulfate, or combinations thereof. 
However, other suitable buffer solutions can also be used. 

Detection can further include treating the hybridized complex with a conjugating 
solution to effect conjugation or couplmg of the hybridized complex with the detection 
5 label, and treating the conjugated, hybridized complex with a detection reagent. In one 
example, the conjugating solution includes streptavidin alkaline phosphatase, avidin 
alkaline phosphatase, or horseradish peroxidase. Specific, non-limiting examples of 
conjugating solutions include streptavidin alkaline phosphatase, avidin alkaline 
phosphatase, or horseradish peroxidase. The conjugated, hybridized complex can be 

1 0 treated with a detection reagent. In one example, the detection reagent includes 

enzyme-labeled fluorescence reagents or calorimetric reagents. In one specific non- 
limiting example, the detection reagent is enzyme-labeled fluorescence reagent (ELF) 
from Molecular Probes, Inc. (Eugene, OR). The hybridized complex can then be placed 
on a detection device, such as an ultraviolet (UV) transilluminator (manufactured by 

15 UVP, Inc. of Upland, CA). The signal is developed and the increased signal intensity 
can be recorded with a recording device, such as a charge coupled device (CCD) camera 
(manufactured by Photometries, Inc. of Tucson, AZ). In particular examples, these 
steps are not performed when radiolabels are used. 

In particular examples, the method further includes quantification, for instance 

20 by determining the amount of hybridization. 

Kits 

The present disclosure provides for kits that can be used to determine whether a 
subject, such as an otherwise healthy human subject, is genetically predisposed to 
25 venous thrombosis. Such kits allow one to determine if a subject has one or more 
genetic mutations, polymorphisms, or both, in sequences associated with venous 
thrombosis, including those Usted in Table 1. 
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The disclosed kits include a binding molecule, such as an oligonucleotide probe 
that selectively hybridizes to a VT-related molecule (such as a mutant or wild-type 
nucleic acid molecule) that is the target of the kit. In one example, the kit includes the 
oligonucleotide probes shown in SEQ ID NOS: 1-302, or a subset thereof, such as even- 
5 numbered SEQ ID NOS: 2-302 or odd-numbered SEQ ID NOS: 1-301 . 

In a particular example, kits include antibodies capable of binding to wild-type 
VT-related proteins or to mutated/polymorphic proteins. Such antibodies have the 
ability to distinguish between a wdld-type and a mutant/polymorphic VT-related protein. 

The kit can further include one or more of a buffer solution, a conjugating 
1 0 solution for developing the signal of interest, or a detection reagent for detecting the 
signal of interest, each in separate packaging, such as a container. In another example, 
the kit includes a plurality of VT-related target nucleic acid sequences for hybridization 
with a VT detection array to serve as positive control. The target nucleic acid sequences 
can include oligonucleotides such as DNA, RNA, and peptide-nucleic acid, or can 
1 5 include PGR fragments. 



Venous Thrombosis Preventative Therapy 
The present disclosure also provides methods of avoiding or reducing the 
incidence of venous thrombosis in a subject determined to be genetically predisposed to 

20 developing venous thrombosis. For example, if using the screening methods described 
above a mutation or polymorphism in at least one VT-related molecule m the subject is 
detected, a treatment is selected to avoid or reduce the incidence of venous thrombosis 
or to delay the onset of venous thrombosis. The subject then can be treated in 
accordance with this selection, for example by administration of one or more 

25 anticoagulant agents. In some examples, the treatment selected is specific and tailored 
for the subject, based on the analysis of that subject's profile for one or more VT-related 
molecules. 
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The disclosure is further illustrated by the following non-limiting Examples. 
EXAMPLE 1 

Mutations and Polymorphisms Associated with Venous Thrombosis 

5 Table 1' describes VT-related nucleic acid and protein sequences used to design 

an array that allows for screening of 151 venous thrombosis-associated recurrent 
mutations and polymorphisms in eight different genes. For each potential site of 
mutation/polymorphism, two oligonucleotide probes were designed (see Example 3). 

10 Table 1: Mutations and polymorphisms associated with venous thrombosis. 



Gene 


Mutation or polymorphism* 


Factor V 


1691G/A; 1628G/A; 4070A/G; 1090A/G; 1091G/C 


Prothrombin (Factor 11) 


20210G/A 


Fibrinogen 


g chain: a(16)Arg/Cys; a(16)Arg/His; a(19)Arg/Gly; 

a(461)Lys/stop; a(554)Arg/Cys 

3 chain: P(14)Arg/Cys; P(68)Ala/Thr; p(255)Arg/Cys 

Y chain: Y(275)Arg/Cys; Y(275)Arg/His; y(292)GlyA^al; 

y(308)Asn/Lys; Y(318)Asp/Gly 

Fibrinogen gene nolvmorphism: Thr312Ala 


Protein C 


Type I PC deficiency: 41G/A: 1357C/T: 1 381 C/T: 
3mC/T; 3169T/C; 3217G/T; 3222G/A; 3222G/T; 
3359G/A; 3360C/A; 3363/4, insC; 3439C/T; 6128T/C; 
6152C/T; 6182C/T; 6216C/T; 6246G/A; 6245CA^; 
6265G/C; 6274C/T; 7176G/A; 7253C/T; 8403C/T; 
8481A/G; 8485/6 delAC or 8486/7 delCA; 8551 C/T; 
8559G/A; 8571 C/T; 8572G/A; 8589G/A; 8604G/A; 
8608C/T; 8631C/T; 8678-80 deBnt; 8689T/C; 8695C/T; 
8763G/A; 8857, delG; 8895A/C; 8924C/G 
Type II PC deficiency: 1387C/T: 1388G/A: 1432C/T: 
6218C/T; 6219G/A; 7219C/A; 8470G/A;8744G/A; 
8769C/T; 8790G/A; 8886G/A 
PC gene polymorphisms: -1654C/T; -1641 A/G 


Protem S 


Quantitative PS deficiency (type I and type IID: -34. TC 


(delG); -24, GTG/GAG; 19, GAAATAA; 26, 
GAA/GCA; 44, TA (delCTTA); 46, GTT/CTT; intron d, 
G/A, exon 4 +1; 155, AAG/GAG; 217, AAT/AGT; 238, 
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CAG/TAG; 265, TTT (ins T), 293, TCA/TGA; 295, 
GGC/GTC; intron j, G/A, exon 10 +5; 349, GAA/AAA; 
372, delCTTTTT, insAA; intron k, A/G, exon 12 -9; 
405, CTA/CCA; 410, CGA/TGA; 431, AA (insA); 465, 
TGG/TGA; 474, CGT/TGT; 522, CAG/TAG; 534, 
CTG/CGG; 625, TGT/CGT 

Qualitative PS deficiency (type ID: -2. CGT/CTT: 9. 


AAA/GAA; intron e, G/A, exon 5 +5 

Unknown type of PS deficiency: -25, CT finsTl: 467. 

GTA/GGA; 633, (delAA); 636, TAA/TAT 

PS gene polymorphisms: 35, CCG/CTG: intron b. G/A. 

exon 2 +5; 303, ATC/ATT; intron k, C/T, exon 1 1 +54; 

460, TCC/CCC; 626, CCA/CCG; exon 15, T/G 18 nt 

after the stop codon; exon 1 5, C/A 520 nt after the stop 

codon 


AT III 


Type I AT III deficiency: 2770insT, 5311-5320del6bp. 
5356-64delCTT, 5381 C/T, 5390C/T, 5493 A/G, 
6490C/T, 9788G/A, 9819C/T, 13342insA, 13380T/C 
Type II AT III deficiency: 


RS mutations: 6460A/G, 13262G/A, 13268G/C, 

13268G/T, 13295C/T, 13296G/A, 13299C/T 

HBS mutations: 2484T/A, 2586C/T, 2603 C/T, 

2604G/A, 2759C/T, 5382G/A 

PE mutations: 13324C/A, 13328G/A, 13333C/G, 

13337C/A, 13338C/T, 13392G/C 

AT III gene polymorphisms: 76bp dimorphism. 


7596G/A, 7626G/A, 7987T/C, 9893C/G 


ACE 


Intron 16, 288 bp insertion/deletion 


MTHFR 


677C/T; 1298A/C 



♦Nucleotide or amino acid number refers to the human sequence, although one skilled in the art can 
determine the corresponding nucleotide or amino acid for other organisms. 



EXAMPLE 2 

Statistical Analysis in the Prediction of Venous Thrombosis 

This example demonstrates that MERT offers a high magnitude clinical validity 
by assessing 151 alleles simultaneously in identifying individuals at very high risk of 
developing VT, even if the contribution of each allele to the risk is small and not 
enough to cause VT. 
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To demonstrate statistically that the disclosed methods can predict a healthy 
subject's probability of developing venous thrombosis, the following methods were 
used. The results described below demonstrate that disease prediction for venous 
thrombosis is greatly improved by considering multiple predisposing genetic factors 
5 concurrently. To demonstrate how concurrent screening of multiple venous thrombosis 
(VT) associated susceptibility gene defects improves the prediction of developing 
venous thrombosis, likelihood ratios for each VT associated susceptibility gene test 
were calculated by logistic regression and then the combmed likelihood ratio (LR) for 
the panel of VT associated susceptibility gene tests was calculated simply as the product 
10 of the likelihood ratios (LRs) of the individual tests assuming each test is independent. 

While the resiilts for each test were dichotomous which there were only two 
possible outcomes and were classified as "positive" vs "negative, the associated LR 
took the values LR=1 for an allele-positive test or LR=0 for an allele-negative test. 

For the calculations, 10 VT associated susceptibility alleles in eight VT 
1 5 associated genes with an established prevalence both in the general population and 
unselected VT patients were selected. 

The relevant allele frequencies were derived for FV Leiden (G 1691 A), FY 
G1628A, FV R2 allele, prothrombin G20210A, fibrinogen Thr312Ala, ACE DD and 
MTHFR C677G variants and protein C, protein S and AT III deficiencies using data 
20 from previously reported case-control studies conducted in the different ethnic 

populations regarding VT associated genetic susceptibility (Seligsohn and Lubetsky, A': 
Engl. J. Met/. 344: 1222- 1231, 2001; S,a\omon et al.,Arterioscler. Thromb. Vase. Biol. 
19:51 1-518, 1999;. De Stefano etal. Semin. Thromb. Hemost. 24:367-379, 1998; Helley 
etal. Hum. Genet. 100:245-8, 1997; Hiyoshie/a/. Thromb. Haemost. 80:705-6, 
25 1998;.Le et al. Clin. Genet. 57:296-303, 2000;. Alhenc-Gelas et al. Thromb. Haemost. 
81:193-97, 1999;. Benson a/. Thromb. Haemost. 86:1188-92, 2001; PoorteM/. Blood 
88:3698-3703, 1996; Hillarpe/a/. Thromb. Haemost. 78:990-992,1997; Ferraresi etal. 
Arterioscler. Thromb. Vase. Biol. 17:2418-2422, 1997;. Leroyere/a/. Thromb. 
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Haemost. 80:49-51, 1998; Rosendaal e/a/. Thromb Haemost 79 J06-70S, 1998; 

Ehien£oT± et al. Arterioscler. Thromb. Vase. Biol. 19:276-280, 1999;.Eichinger e/a/. 

Thromb. Haemost. 81:14-17, 1999; Tosetto etal. Thromb. Haemost. 82:1395-1398, 

1999; Souto etal Thromb. Haemost. 80:366-369, 1998;.Alhenc-Gelas etal. Thromb. 
5 Haemost. 8 1 :506-5 1 0, 1 999;.Corral et al. Br. J. Haematol. 99:304-307, 1 997;.Brown et 

al. Br. J. Haematol. 98:907-909, 1997; Hainaut etal. Acta Clin Belg 53:344-348, 1998; 

Howard era/. Blood 9\ -.1092, 1998; Ridker e/ a/. Circulation 99:999-1004,1999; 

Margaglione etal. Ann. Intern. Med. 129:89-93, 1998;. Rees etal. Br. J. Haematol. 

105:564-566, 1999; De Stefano etal. Thromb. Haemost. 80:342-343, 1998;. Liu Y 
10 et al., J. Med Genet. 38:31-5, 2001; Kain et al. Am. J. Epidemiol 156:174-9, 2002; 

Miletich et al. N. Engl J. Med.3\l .991-996, 1987; Koster et al Leiden Thrombophilia 

Study. 85:2756-2761, 1995; Ben-Tal etal, Thromb. Haemost. 61:50-54, 1989; Heijboer 

etal N. Engl J. Med 323:1512-1516, 1990;. Pabinger e/ a/.. Blood Coagul 

Fibrinolysis 3:547-553, 1992;. Melissari et al. Blood. Coagul Fibrinolysis 3:749-758, 
15 1992;. Tait et al, Thromb. Haemost. 73:87-93, 1995; Tabemero et al. Am. J. Hematol 

36:249-254, 1991; Horellou a/., 289; 1285- 1287, 1984; Gladson a/.. Thromb. 

Haemost. 59:18-22, 1988; Dykes etal, Br. J. Haematol 113:636-641; 2001; Rees et 

al, Br J Haematol 105:564-566, 1999; Harper a/. Br. J. Haemotol 77:360-364, 1991; 

DuH5ek et al. Blood Coagul Fibrinolysis 13:569-573, 2002; Bombeli et al Am. J. 
20 Hematol 70:126-132, 2002; Arruda etal, Thromb. Haemost. 77:818-821, 1997; 

Margaglione etal, Thromb. Haemost. 79:907-911, 1998; Dilley era/.. Am. J. 

Epidemiol 147:30-5, 1998; Fatini etal, Eur. J. Clin. Invest. 33:6442-647, 2003) 

(Table 2). 
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Table 2: Frequency of inherited thrombophilias among healthy subjects and 
unselected and selected patients with VT 





Healthy subjects 


Unselected patients 




# Total 


# Affected 


# Total 


# Affected 












FVgeneG1691A 
(Leiden) polymorphism 


2,192 t 


1 (0.05%)^ 




215 (18.8%) 


FVgeneG1628A 
polymorphism 


65 t 
587 J 


8(12 3%) 
354 (60.3%) 




107 (65.2%) 


FV gene R2 allele 


394 t 
472 J 

532 § 


45 (11.4%) 
48 (10.2%) 
60(11.3%) 


205 


38 (18.5%) 


Prothrombin gene 
G20210A polymorphism 


1 1,932 t 
1,811 t 


322 (2.7%) 
1 (0.06%) 


2884 


205 (7.2%) 


Fibrinogen gene 

Thr312Ala 

polymorphism 


250 1 
402 $* 


101(40.4%) 
270(67.2%) 


98 


50 (51%) 


Protein C deflciency 


15,070 


60 (0.4%) 


2008 


74 (3.7%) 


Protein S dericiency 


3,788 


5(0.13%) 


2008 


46 (2.3%) 


Antithrombin 
deflciency 


9,669 


16(0.16%) 


2254 


82 (3.6%) 


MTHFRgeneC677T 

polymorphism (TT) 


1,063 t 


137(12.9%) 


511 


111 (21.7%) 


ACE gene DO genotype 


378 t 
911* 


101 (26.7%) 
20 (22%) 


336 
39 


140(41.7%) 
17(43.6%) 



t All subjects were Caucasian; JAll subjects were African or Asian (African Americans were excluded); 
I All subjects were African American or Asian; §A11 subjects were Hispanic; J* All subjects were Asian; 
J* All subjects were African-American. 



Calculations were done for the panel of the selected 10 VT associated 
susceptibility screening tests, which can concisely be described by v/ (v/^ugi^, v2=gi628a> 

Thr312Ala, ^6=proteinC' ^ 7=protein S' ^5=antithrombin, VP=ACEDD V70=mtofr). 

The LR calculations were performed by logistic regression, which is a variation 
of ordinary regression and useful when the observed test outcome is dichotomous or 
restricted to two values which usually represent the occurrence or non-occurrence of 
some outcome event like in our model. By treating the data retrieved from the 
previously reported case control studies regarding VT genetic susceptibility in different 
ethnic populations as a valid estimate of the risk odds ratio, LR for each allele positive 
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test was calculated by exponentiation of the result obtained by using the equation below 
(Albert A, Clin Chem.. 28:1 1 13-1 119, 1982; McCullagh P and Nelder JA, Chapman 
and Hall, London, 1989; Yang Q, etal.,Am. J. Hum. Genet., 72:636-649, 2003): 

hi LR(v/ ) = hi (Nco/Nca) + a +P v/ =a* + p v/ 
where a and P are the intercept term and logistic regression coefficient of the odds of 
developmg VT, respectively; NcaIs the number of cases and Ncois the number of 
control subjects derived from the previously reported data for each VT associated gene 
allele and a* is the adjusted intercept term = . a + In (Nco/Nca). 

The posterior probability of venous thrombosis (the probability of developing 
venous thrombosis) was determined for the individuals with allele-positive test results 
for each genetic test (also known as positive predictive value of each genetic test) by the 
equation: 

positive I VT) = myipositive)2£ni. , 

[1 - P(VT)] + LKiyi positive )P(VT) 
where vi positive is the allele positive test result for each VT associated genetic 
susceptibility screening test test, P (yi positive I VT) is the positive predictive value of 
the genetic test or the probability that VT will develop in people with allele positive 
result, LR(v positive) is the likelihood ratio of allele-positive test result and P (VT) is 
the pretest risk of venous thrombosis or the overall incidence of venous thrombosis in 
the general population, which has been estimated to be 1 per 1,000 persons in the 
United States. Calcvdated likelihood ratios and positive predictive values for each 
venous thrombosis associated susceptibility gene test were demonstrated in Table 3. 
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Table 3. Likelihood ratios and Positive predictive values of single susceptibility 
genes and multiple genetic screening with MERT for developing VT in healthy 
subjects 



Single susceptibility test analysis 


Likelihood Ratio 


Posterior 






probability of 






developing VT 


Factor V gene 






G 1691 A (Leiden) polymorphism 


3.9 


0.39% 


G 1628 A polymorphism 


1.1 


0.11% 


R2 allele 


1.6 


0.16% 


Prothrombin gene G20210A 


2.6 


0.26 % 


polymorphism 






Fibrinogen Thr3 1 2Ala polymorphism 


1.26 


0.126% 


Protein C deficiency 


9.3 


0.93 % 


Protein S deficiency 


17 


1.7% 


Antithrombin deficiency 


22 


2.2% 


ACE DD genotype 


1.56t 


0.156% 




1.98§ 


0.198% 


MTHFR gene C677T polymorphism (TT) 


1.7t 


0.17% 


Concurrent screening of 8 genes with 


214110.3t 


99.5% 


MERT 


7713.31 


88.5% 




110189§ 


91.7% 



t Caucasian populations; { Asian populations; § African populations 



Then, assuming that the effect of each of the genetic defects in the eight different 
genes is independent and that all interactive effects are purely multiplicative, the LR 
was calculated for the panel of VT associated genetic susceptibility tests by the equation 
that stating the LR for a panel of independent tests is simply the product of the 
1 0 likelihood ratios of the individual test results (Yang Q, et al. ,Am.J. Hum. Genet. , 
72:636-649, 2003): 

LR(V) = LR(v;) LR(v2) LR(vj) LR(v^) LR(vj) LR(v5) LR(v7) LR(v5) LR(vp) LR(yio) 
where LR(V) is the likelihood ratio for the panel of selected 10 independent VT 
associated genetic susceptibility tests. 
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As shown in Table 3, whereas each genetic test provides limited predictive 
information about the probability of developing venous thrombosis (the posterior 
probabilities of disease range from 0. 1 1 % to 2.2 % for each test alone), the posterior 
probability of venous thrombosis occurring increases to 99.5 % when estimated with 
5 unselected patients for Caucasians and 88.5 % for Asians and 9 1 .7 % for African 
populations by using the disclosed methods, an increase of > 40-fold. 

EXAMPLES 

Array for Detecting Susceptibility to Venous Thrombosis 

10 For each potential site of mutation/polymorphism (Table 1), two oligonucleotide 

probes were designed (SEQ ID NOS 1-302). The first is complementary to the wild 
type sequence (odd numbers of SEQ ID NOS: 1-301) and the second is complementary 
to the mutated sequence (even numbers of SEQ ID NOS: 2-302). For example, SEQ ID 
NO: 1 is complementary to a wild-type FV sequence, which can be used to detect the 

1 5 presence of a "G" at nucleotide 1 69 1 , and SEQ ID NO: 2 is complementary to a mutant 
FV sequence, which can be used to detect the presence of a "A" at nucleotide 1691 . 
The disclosed oligonucleotide probes can further include one or more detectable labels, 
to permit detection of hybridization signals between the probe and a target sequence. 

Compilation of "loss" and "gain" of hybridi2ation signals will reveal the genetic 

20 status of the individual with respect to the 1 5 1 known VT-associated recurrent defects. 

EXAMPLE 4 
Nucleic Acid-Based Analysis 
The VT-related nucleic acid molecules provided herein can be used in methods 
25 of genetic testing for predisposition to venous thrombosis owing to VT-related nucleic 
acid molecule polymorphism/mutation in comparison to a wild-type nucleic acid 
molecule. For such procedures, a biological sample of the subject is assayed for a 
polymorphism or mutation (or both) in a VT-related nucleic acid molecule, such as 
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those listed in Table 1. Suitable biological samples include samples containing genomic 
DNA or RNA (including mRNA) obtained from cells of a subject, such as those present 
in peripheral blood, urine, saliva, tissue biopsy, surgical specimen, amniocentesis 
samples and autopsy material. 

The detection in the biological sample of a polymorphism/mutation in one or 
more VT-related nucleic acid molecules, such as those listed in Table 1, can be achieved 
by methods such as hybridization using allele specific oligonucleotides (ASOs) 
(Wallace et al, CSHL Symp. Quant. Biol. 51:257-61, 1986), direct DNA sequencing 
(Church and Gilbert Proc. Natl. Acad. Sci. USA 81:1991-1995, 1988), the use of 
restriction enzymes (Flavell et al, Cell 15:25, 1978; Geever et al, 1981), discrimination 
on the basis of electrophoretic mobility in gels with denaturing reagent (Myers and 
Maniatis, Cold Spring Harbor Symp. Quant. Biol. 51 :275-284, 1986), RNase protection 
(Myers et al, Science 230:1242, 1985), chemical cleavage (Cotton et al, Proc. Natl 
Acad. Set USA 85:4397-4401, 1985), and the ligase-mediated detection procedure 
(Landegren et al. Science 241:1077, 1988). 

Oligonucleotides specific to wild-type or mutated VT-related sequences can be 
chemically synthesized using commercially available machines. These oligonucleotides 
can then be labeled, for example with radioactive isotopes (such as ^^P) or with non- 
radioactive labels such as biotin (Ward and Langer et al, Proc. Natl Acad. ScL USA 
78:6633-6657, 1981) or a fluorophore, and hybridized to individual DNA samples 
immobilized on membranes or other solid supports by dot-blot or transfer from gels 
after electrophoresis. These specific sequences are visualized, for example by methods 
such as autoradiography or fluorometric (Landegren et al. Science 242:229-237, 1989) 
or colorimetric reactions (Gebeyehu et al. Nucleic Acids Res. 15:4513-4534, 1987). 
Using an ASO specific for a wild-type allele, the absence of hybridization would 
indicate a mutation or polymorphism in the particular region of the gene. In contrast, if 
an ASO specific for a mutant allele hybridizes to a clinical sample then that would 
indicate the presence of a mutation or polymorphism in the region defmed by the ASO. 
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EXAMPLE 5 
Protein-Based Analysis 
This example describes methods that can be used to detect defects in an 
5 amount of a VT-related protein, or to detect changes in the amino acid sequence itself 
VT-related protein sequences can be used in methods of genetic testing for 
predisposition to venous thrombosis owing to VT-related protein polymorphism or 
mutation (or both) in comparison to a wild-type protein. For such procedures, a 
biological sample of the subject is assayed for a polymorphism/mutation in a VT-related 
1 0 protein, such as those listed in Table 1 . Suitable biological samples include samples 
containing protein obtained from cells of a subject, such as those present in peripheral 
blood, urine, saliva, tissue biopsy, surgical specimen, anuiiocentesis samples and 
autopsy material. 

A decrease in the amount of one or more VT-related proteins in a subject, can 
1 5 indicate that the subject has an increased susceptibility to developing VT. Similarly, the 
presence of one or more mutations/polymorphisms in a VT-related protein in 
comparison to a wild-type protein, can indicate that the subject has an increased 
susceptibility to developing VT. 

The determination of reduced VT-related protein levels, in comparison to such 
20 expression in a normal subject (such as a subject not predisposed to developing VT), is 
an alternative or supplemental approach to the direct determination of the presence of 
VT-related nucleic acid mutations/polymorphisms by the methods outlined above. The 
availability of antibodies specific to particular VT-related protein(s) will facilitate the 
detection and quantitation of cellular VT-related protein(s) by one of a niomber of 
25 immimoassay methods which are well known in the art, such as those presented in 
Harlow and Lane (Antibodies. A Laboratory Manual, CSHL, New York, 1988). 
Methods of constructing such antibodies are known in the art. 
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The determination of the presence of one or more mutations/polymorphisms in a 
VT-related protein, in comparison to a wild-type VT-related protein, is another 
alternative or supplemental approach to the direct determination of the presence of VT- 
related nucleic acid mutations/polymorphisms by the methods outlined above. 
5 Antibodies that can distinguish between a mutant/polymorphic protein and a wild-type 
protein, can be prepared using methods knovra in the art. 

Any standard immunoassay format (such as ELISA, Western blot, or RIA assay) 
can be used to measure VT-related polypeptide or protein levels, and to detect 
mutations/polymorphisms in VT-related proteins. A comparison to wild-type (normal) 

1 0 VT-related protein levels, and a decrease in VT-related polypejptide levels is indicative 
of predisposition to developing VT. Similarly, the presence of one or more 
mutant/polymorphic VT-related proteins is indicative of predisposition to developing 
VT. Immunohistochemical techniques can also be utilized for VT-related polypeptide 
or protein detection and quantification. For example, a tissue sample can be obtained 

15 from a subject, and a section stained for the presence of a wild-type or 

polymorphic/mutant VT-related protein using the appropriate VT-related protein 
specific binding agents and any standard detection system (such as one that includes a 
secondary antibody conjugated to horseradish peroxidase). General guidance regarding 
such techniques can be found in Bancroft and Stevens (Theory and Practice of 

20 Histological Techniques, Churchill Livingstone, 1982) and Ausubel et al. {Current 
Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 

For the purposes of quantitating a VT-related protein, a biological sample of the 
subject, which sample includes cellular proteins, can be used. Quantitation of a HRPC- 
related protein can be achieved by immunoassay and the amount compared to levels of 

25 the protein foimd in cells from a subject not genetically predisposed to developing VT. 
A significant decrease in the amount of one or more VT-related proteins in the cells of a 
subject compared to the amount of the same VT-related protein found in normal human 
cells is usually about a 30% or greater difference. Substantial underexpression of one or 
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more VT-related protein(s), may be indicative of a genetic predisposition to developing 
VT. 

EXAMPLE 6 
5 Kits 

Kits are provided to determine whether a subject has one or more 
polymorphisms, mutations, or both, in a VT-related nucleic acid sequence (such as kits 
containing VT detection arrays). Kits are also provided that contain the reagents need to 
detect hybridization complexes formed between oligonucleotides on an array and VT- 
10 related nucleic acids amplified from a subject. These kits can each include instructions, 
for instance instructions that provide calibration curves or charts to compare with the 
determined (such as experimentally measured) values. 

In one example, the kit includes primers capable of amplifying VT-related 
nucleic acid molecules, such as those listed in Table 1 . In particular examples, the 
15 primers are provided suspended in an aqueous solution or as a freeze-dried or 

lyophilized powder. The container(s) in which the primers are supplied can be any 
conventional container that is capable of holding the supplied form, for instance, 
microfuge tubes, ampoules, or bottles. In some applications, pairs of primers are be 
provided in pre-measured single use amoimts in individual, typically disposable, tubes, 
20 or equivalent containers. 

The amount of each primer supplied in the kit can be any amoimt, depending for 
instance on the market to which the product is directed. For instance, if the kit is 
adapted for research or clinical use, the amoimt of each oligonucleotide primer provided 
likely would be an amount sufficient to prime several in vitro amplification reactions. 
25 Those of ordinary skill in the art know the amount of oligonucleotide primer that is 
appropriate for use in a single amplification reaction. General guidelines may for 
instance be found in Innis et al. (PCR Protocols, A Guide to Methods and Applications, 
Academic Press, Inc., San Diego, CA, 1990), Sambrook etal. (In Molecular Cloning: 
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A Laboratory Manual. Cold Spring Harbor, New York, 1989), and Ausubel et al. (In 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1998). 

In particular examples, a kit includes an array with oligonucleotides that 
recognize wild-type, mutant/polymorphic, or both, VT-related sequences, such as those 
listed in Table 1 . The array can include other oligonucleotides, for example to serve as 
negative or positive controls. The oligonucleotides that recognize the wild-type and 
mutant sequences can be on the same array, or on different arrays. A particular array is 
disclosed in Example 3. For example, the kit can include the oligonucleotides shown in 
SEQ ID NOS: 1-302, or subsets thereof, such as at least 10 of the oligonucleotides 
shown in SEQ ID NOS: 1-302, for example at least 20, at least 50, at least 100, at least 
151, or even at least 300 of the oligonucleotides shown in SEQ ID NOS: 1-302. In a 
particular example, an array includes either the odd-numbered SEQ ID NOS: 1-301 (i.e. 
SEQ ID NOS: 1, 3, 5, 7, etc.), or the even-numbered SEQ ID NOS: 2-302 (i.e. SEQ ID 
NOS: 2, 4, 6, 8, etc.). However, both such arrays can be included in a single kit. 

In some examples, kits further include the reagents necessary to carry out 
hybridization and detection reactions, including, for instance appropriate buffers. 
Written instructions can also be included. 

Kits are also provided for the detection of VT-related protein expression, for 
instance under expression of a protein encoded for by a nucleic acid molecule listed in 
Table 1 . Such kits include one or more wild-type or mutant factor V (FV), prothrombin 
(factor II), fibrinogen, protein C, protein S, AT III, ACE, or MTHFR proteins (full- 
length, fragments, or fusions) or specific binding agent (such as a polyclonal or 
monoclonal antibody or antibody fi-agment), and can include at least one control. The 
VT-related protein specific binding agent and control can be contained in separate 
containers. The kits can also include a means for detecting VT-related protein:agent 
complexes, for instance the agent may be detectably labeled. If the detectable agent is 
not labeled, it can be detected by second antibodies or protein A, for example, either of 
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both of which also can be provided in some kits in one or more separate containers. 
Such techniques are well known. 

Additional components in some kits include instructions for carrying out the 
assay. Instructions permit the tester to determine whether VT-linked expression levels 
5 are reduced in comparison to a control sample. Reaction vessels and auxiliary reagents 
such as chromogens, buffers, enzymes, etc. can also be included in the kits. 

In view of the many possible embodiments to which the principles of our 
invention may be applied, it should be recognized that the illustrated embodiment is 
1 0 only a preferred example of the invention and should not be taken as a limitation on the 
scope of the invention. Rather, the scope of the invention is defined by the following 
claims. We therefore claim as our invention all that comes within the scope and spirit 
of these claims. 
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We claim: 

1 . A method of detecting a genetic predisposition to venous thrombosis (VT) in a subject, 

comprising: 

determining whether the subject has one or more mutations, polymorphisms, or both 
(mutations/polymorphisms) in at least eight VT-related molecules, wherein the at least eight 
venous thrombosis molecules comprise factor V (FV), prothrombin (factor II), fibrinogen, 
'protein C, protein S, antithrombin III (AT III), angiotensin 1 -converting enzyme (ACE), and 
methylenetetrahydrofolate reductase (MTHFR), wherein the presence of one or more 
mutations/polymorphisms indicates that the subject has a genetic predisposition for venous 
thrombosis. 

2. The method of claim 1, wherein the one or more mutations/polymorphisms comprise one or 
more mutations/polymorphisms listed in Table 1, 

3. The method of claim 1, wherein the method comprises determining whether the subject has 
one or more mutations in at least 10 of the mutations/polymorphisms listed in Table 1 . 

4. The method of claim 1, wherein the method comprises determining whether the subject has 
one or more mutations in at least 50 of the mutations/polymorphisms listed in Table 1 . 

5. The method of claim 1, wherein the method comprises determining whether the subject has 
one or more mutations/polymorphisms in at least 151 of the mutations/polymorphisms listed in 
Table 1. 

6. The method of claim 1, wherein the method comprises determining whether the subject has 
one or more mutations/polymorphisms in no more than 10 of the mutations/polymorphisms 
listed in Table 1. 
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7. The method of claim 2, wherein the one or more mutations/polymorphisms comprise FV 
Leiden polymorphism (G 1691 A), FV G1628 polymorphism, FV A4070G polymorphism, 
prothrombin G20210A polymorphism, fibrinogen Thr312Ala polymorphism, PC deficiency, PS 
deficiency, AT III deficiency, ACE intron 16, 288 bp insertion/deletion, and MTHFR C677T 
polymorphism. 

8. The method of claim 1, wherein the method provides a probability of developing VT of at 
least 98% in Caucasians, at least 88% in Asians, and at least 90% in Afiicans. 

9. The method of claim 1 , wherein the method comprises determining whether the subject has 
one or more mutations/polymorphisms in at least eight VT-related molecules. 

10. The method of claim 1, wherein the at least eight VT-related molecules comprise nucleic 
acid molecules. 

1 1 . The method of claim 10, wherein the nucleic acid molecules are amplified fi-om the subject, 
thereby generating amplification products, and wherein the amplification products are hybridized 
with oligonucleotide probes that detect the one or more mutations/polymorphisms. 

12. The method of claim 1 1, wherein hybridizing the oligonucleotides comprises: 

incubating the amplification products with the oligonucleotide probes for a time sufficient 
to allow hybridization between the amplification products and oligonucleotide probes, thereby 
forming amplification products: oUgonucleotide probe complexes; and 

analyzing the amplification products: oligonucleotide probe complexes to determine if 
the amplification products comprise one or more mutations/polymorphisms in the VT-related 
nucleic acids, wherein the presence of one or more mutations/polymorphisms indicates that the 
subject has a genetic predisposition for VT. 
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13. The method of claim 12, wherein analyzing the amplification products: oligonucleotide probe 
complexes comprises determining an amoimt of nucleic acid hybridization, and wherein a greater 
amount of hybridization to one or more of the mutated sequences, as compared to an amount of 
hybridization to a corresponding wild-type sequence, indicates that the subject has a genetic 
predisposition for VT. . 

14. The method of claim 12, wherein analyzing the amplification products:oligonucleotide probe 
complexes includes detecting and quantifying the complexes. 

15. The method of claim 11, wherein the oligonucleotide probes are present on an array 
substrate. 

16. The niethod of claim 15, wherein the array fiirther comprises oligonucleotide probes 
complementary to wild-type VT-related nucleic acid molecules. 

17. The method of claim 16, wherein the wild-type VT-related nucleic acid molecules comprise 
oligonucleotide probes complementary to wild-type factor V, wild-type factor II, wild-type 
fibrinogen, wild-type protein C, wild-type protein S, wild-type AT III, wild-type ACE, and wild- 
type MTHFR nucleic acid sequences. 

18. The method of claim 1, wherein the at least eight VT-related molecules consist of sequences 
from factor V, factor II, fibrinogen, protein C, protein S, AT III, ACE, and MTHFR. 

19. The method of claim 1, wherein the subject is m a group potentially at risk of developing a 
venous thrombosis. 

20. The method of claim 19, wherein the subject is pregnant, is in puerperium, is using oral 
contraceptives or hormone replacement therapy, has previous thrombosis history, has or will 
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undergo prolonged immobilization, has a myeloproliferative disorder, has a malignancy, has or 
will undergo surgery, has a bone fracture, is of advanced age, has antiphospholipid antibodies, or 
combinations thereof. 

21. The method of claim 11, wherein the nucleic acid molecules obtained from the subject are 
obtained from serum. 

22. A method of detecting a genetic predisposition to VT in a subject, comprising: 

applying amplification products to an array, wherein the array comprises oligonucleotide 
probes complementary to mutated factor V, mutated factor II, mutated fibrinogen, mutated 
protein C, mutated protein S, mutated AT III, mutated ACE, and mutated MTHFR sequences, 
and wherein the amplification products comprise nucleic acid sequences from factor V, factor II, 
fibrinogen, protein C, protein S, AT III, ACE, and MTHFR obtained from the subject; 

incubating the amplification products with the array for a time sufficient to allow 
hybridization between the amplification products and oligonucleotide probes, thereby forming 
amplification products: oligonucleotide probe complexes; and 

analyzing the amplification products: oligonucleotide probe complexes to determine if 
the amplification products comprise one or more mutations/polymorphisms in the factor V, 
factor II, fibrinogen, protein C, protein S, AT III, ACE, or MTHFR sequences, wherein the 
presence of one or more mutations/polymorphisms indicates that the subject has a genetic 
predisposition for VT. 

23. A method of selecting a venous thrombosis (VT) therapy, comprising: 

detecting a mutation/polymorphism in at least one VT-related molecule of a subject, 
using the method of claim 1 ; and 

if such mutation/polymorphism is identified, selecting a treatment to avoid or reduce VT, 
or to delay tiie onset of VT. 
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24. The method of claim 23, further comprising administering the selected treatment to the 
subject. 

25. The method of claim 24, wherein the selected treatment comprises treating the subject with 
an anticoagulant agent. 

26. An array comprising oligonucleotide probes complementary to wild-type gene sequences, 
mutated gene sequences, or both, wherein the gene sequences comprise coding or non-coding 
sequences from factor V, factor II, fibrinogen, protein C, protein S, AT III, ACE, and MTHFR 
genes. 

27. The array of claim 26, wherein the mutated gene sequences comprise ten or more 
mutations/polymorphisms listed in Table 1. 

28. The array of claim 27, wherein the mutated gene sequences consist essentially of the 
mutations/polymorphisms listed in Table 1. 

29. A method of detecting a genetic predisposition to venous thrombosis (VT) in a subject, 
comprising: 

applying amplification products to the array of claim 13, wherein the amplification 
products comprise amplified nucleic acids obtained from the subject, wherein the nucleic acids 
comprise coding or non-coding sequences from factor V, factor II, fibrinogen, protein C, protein 
S, AT III, ACE, and MTHFR; 

incubating the amplification products with the array for a time sufficient to allow 
hybridization between the amplification products and oligonucleotide probes, thereby forming 
amplification products: oligonucleotide probe complexes; and 

analyzing the amplification products: oligonucleotide probe complexes to determine if 
the amplification products comprise one or more mutations/polymorphisms in the factor V, 
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factor II, fibrinogen, protein C, protein S, AT III, ACE, or MTHFR genes, wherein the presence 
of one or more mutations/polymorphisms indicates that the subject has a genetic predisposition 
forVT. 

30. A kit for detecting a genetic predisposition to venous thrombosis (VT) in a subject, 
comprising: 

a solid phase nucleic acid array comprising a plurality of oligonucleotide probes 
chemically linked to a solid polymeric support surface in a predetermined pattern, wherein the 
oligonucleotide probes are capable of hybridizing under stringent conditions to one or more 
nucleic acid molecules having VT-related mutations/polymorphisms in factor V, factor II, 
fibrinogen, protein C, protein S, AT III, ACE, and MTHFR genes. 

31. The kit of claim 30, wherein the ohgonucleotides comprise SEQ ID NOS: 1-302. 

32. The kit of claim 30, fiirther comprising primers for amplifying nucleic acid molecules 
obtained from the subject to obtain amplification products, in separate packaging, wherein the 
amplification products comprise sequences fi-om factor V, factor II, fibrinogen, protein C, protein 
S, AT III, ACE, and MTHFR genes; 

33. The kit of claim 30, fiirther comprising an amplification enzyme, in separate packaging. 

34. The kit of claim 30, fiirther comprising a buffer solution, in separate packaging. 

35. The kit of claim 30, wherein the array fiirther comprises oligonucleotides capable of 
hybridizing under stringent conditions to a wild-type factor V, wild-type factor II, wild-type 
fibrinogen, wild-type protein C, wild-type protein S, wild-type AT III, wild-type ACE, and wild- 
type MTHFR. 
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ABSTRACT 

METHOD EVOLVED FOR RECOGNITION OF THROMBOPHILIA (MERT) 
Methods for predicting an individual's risk for developing venous thrombosis is 
disclosed, as are arrays and kits which can be used to practice the method. The method includes 
screening for mutations, polymorphisms, or both, in at least eight venous thrombosis-related 
molecules, such as factor V, prothrombin (factor II), fibrinogen, protein C, protein S, 
antithrombin III, angiotensin I-converting enzyme (ACE), and methylenetetrahydrofolate 
reductase (MTHFR) molecules which are associated with venous thrombosis. 
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<110> Dogulu, Cigdem 
Rennert , Owen 
Chan, Wai-Yee 

<120> Method evolved for recognition of thrombophilia 
<130> 66342 
<160> 302 

<170> Patentin version 3.2 

<210> 1 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 1 

ctggacaggc gaggaataca g 



<210> 2 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 2 

ctggacaggc aaggaataca g 



<210> 3 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 3 

gacatcatga gagacatcgc c 



<210> 4 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<400> 4 

gacatcatga aagacatcgc c 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 5 

gacctcagcc atacaaccct t 



<210> 6 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 6 

gacctcagcc gtacaaccct t 



<210> 7 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 7 

aaagaaaacc aggaatctta a 



<210> 8 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 8 

aaagaaaacc gggaatctta a 



<210> 9 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 9 

aagaaaacca ggaatcttaa g 



<210> 10 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 10 

aagaaaacca cgaatcttaa g 



<210> 11 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 11 

gactctcagc gagcctcaat g 



<210> 12 

<211> 21 

<212> DNA 

<213> Artificial secjuence 
<220> 

<223> oligonucleotide probe 

<400> 12 

gactctcagc aagcctcaat g 



<210> 13 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 13 

aggaggcgtg cgtggcccaa g 



<210> 14 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 14 

aggaggcgtg tgtggcccaa g 



<210> 15 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 15 

ggaggcgtgc gtggcccaag g 



<210> 16 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 16 

ggaggcgtgc atggcccaag g 



<210> 17 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 17 

gcgtggccca agggttgtgg a 



<210> 18 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<400> 18 

gcgtggccca ggggttgtgg a 



<210> 19 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 19 

agaagttacc aaagaagtgg t 



<210> 20 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 20 

agaagttacc taagaagtgg t 



<210> 21 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 21 

attcccttcc cgtggtaaat c 



<210> 22 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 22 

attcccttcc tgtggtaaat c 



<210> 23 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 23 

cttcagtgcc cgtggtcatc g 



<210> 24 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 24 

Cttcagtgcc tgtggtcatc g 



<210> 25 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 25 

ctgtcttcac gctgacccag a 



<210> 26 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 26 

ctgtcttcac actgacccag a 



<210> 27 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 27 

gattcagaac cgtcaagacg g 



<210> 28 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 28 

gattcagaac tgtcaagacg g 



<210> 29 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 29 

tgacaagtac cgcctaacat a 



<210> 30 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 30 

tgacaagtac tgcctaacat a 



<210> 31 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 31 

gacaagtacc gcctaacata t 



<210> 32 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 32 

gacaagtacc acctaacata t 



<210> 33 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 33 

gcctttgatg gctttgattt t 



<210> 34 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 34 

gcctttgatg tctttgattt t 



<210> 35 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 35 

catcccataa tggcatgcag t 



<210> 36 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 36 

catcccataa gggcatgcag t 



<210> 37 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 37 

tgggacaatg acaatgataa g 



<210> 38 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 38 

tgggacaatg gcaatgataa g 



<210> 39 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 39 

acctggaagt actggaagct g 



<210> 40 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 40 

acctggaagt gctggaagct g 



<210> 41 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 41 

gtggccacct ggggaatttc c 



<210> 42 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 42 

gtggccacct agggaatttc c 



21 



<210> 43 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 43 

cagcagcgag cgtgcccacc a 21 



<210> 44 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<210> 45 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 45 

gctgcggatc cgcaaacgtg c 21 



<210> 46 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<400> 44 

cagcagcgag tgtgcccacc a 



21 



<400> 46 

gctgcggatc tgcaaacgtg c 



21 



<210> 47 
<211> 21 
<!212> DNA 



<213> Artificial sequence 



10 



<220> 

<223> oligonucleotide probe 
<400> 47 

cttggtcttg cccttggagc a 



<210> 48 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 48 

cttggtcttg tccttggagc a 



<210> 49 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 49 

catcggcagc ttcagctgcg a 



<210> 50 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 50 

catcggcagc ctcagctgcg a 



<210> 51 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 51 

ctgccagcgc ggtgaggggg a 



<210> 52 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 52 

ctgccagcgc tgtgaggggg a 



<210> 53 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 53 

agcgcggtga gggggagagg t 



<210> 54 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 54 

agcgcggtga aggggagagg t 



<210> 55 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 55 

agcgcggtga gggggagagg t 



<210> 56 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 56 

agcgcggtga tggggagagg t 



<210> 57 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 57 

aacggcggct gcacgcatta c 



<210> 58 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 58 

aacggcggct acacgcatta c 



<210> 59 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 59 

acggcggctg cacgcattac t 



<210> 60 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 60 

acggcggctg aacgcattac t 



<210> 61 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 61 

ggctgcacgc attactgcct a 



<210> 62 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 62 

gctgcacgcc attactgcct a 



<210> 63 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 63 

cgacctcctg cagtgtcacc c 



<210> 64 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 64 

cgacctcctg tagtgtcacc c 



<210> 65 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 65 

ctcagtgaag ttcccttgtg g 



<210> 66 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 66 

ctcagtgaag ctcccttgtg g 



<210> 67 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 67 

gccctggaag cggatggaga a 



<210> 68 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 68 

gccctggaag tggatggaga a 



<210> 69 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 69 

tcacctgaaa cgagacacag a 



<210> 70 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 70 

tcacctgaaa tgagacacag a 



<210> 71 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 71 

caagtagatc cgcggctcat t 



<210> 72 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> oligonucleotide probe 

<400> 72 

caagtagatc tgcggctcat t 



<210> 73 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 73 

gatgaccagg cggggagaca g 



<210> 74 

<211> 21 

<212> DNA 

<213> Artificial secjuence 
<220> 

<223> oligonucleotide probe 

<400> 74 

gatgaccagg tggggagaca g 



<210> 75 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 75 

atgaccaggc ggggagacag c 



<2X0> 76 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 76 

atgaccaggc agggagacag c 



<210> 77 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 77 

gcccctggca ggtgggaggc g 



<210> 78 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 78 

gcccctggca cgtgggaggc g 



<210> 79 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 79 

aggtgggagg cgaggcagca c 



<210> 80 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 80 

aggtgggagg tgaggcagca c 



<210> 81 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 81 

ctggcctgcg gggcagtgct c 



<210> 82 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 82 

ctggcctgcg aggcagtgct c 

<210> 83 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> oligonucleotide probe 

<400> 83 

ccttgtcagg cttggtatgg g 



<210> 84 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 84 

ccttgtcagg tttggtatgg g 



<210> 85 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 85 

tgacctgcgg cgctgggaga a 



<210> 86 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 86 

tgacctgcgg tgctgggaga a 



<210> 87 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 87 

caccaccgac aatgacatcg c 

<210> 88 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 88 

caccaccgac gatgacatcg c 



<210> 89 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 89 

accaccgaca atgacatcgc a 



:210> 90 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 90 

aagagcacca ccgacaatga t 



<210> 91 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 91 

atctgcctcc cggacagcgg c 



<210> 92 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 92 

atctgcctcc tggacagcgg c 



<210> 93 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 93 

cccggacagc ggccttgcag a 



<210> 94 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 94 

cccggacagc agccttgcag a 



<210> 95 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 95 

ccttgcagag cgcgagctca a 



<210> 96 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 96 

ccttgcagag tgcgagctca a 



<210> 97 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 97 

cttgcagagc gcgagctcaa t 



<210> 98 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 98 

cttgcagagc acgagctcaa t 



<210> 99 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 99 

caatcaggcc ggccaggaga c 
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<210> 100 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 100 

caatcaggcc agccaggaga c 



<210> 101 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 101 

ggagaccctc gtgacgggct g 



<210> 102 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 102 

ggagaccctc atgacgggct g 



<210> 103 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 103 

accctcgtga cgggctgggg c 



<210> 104 

<211> 21 

<212> DNA 

<213> Artificial sequence 



22 



<220> 

<223> oligonucleotide probe 
<400> 104 

accctcgtga tgggctgggg c 



<210> 105 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 105 

ccacagcagc cgagagaagg a 



<210> 106 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 106 

ccacagcagc tgagagaagg a 



<210> 107 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 107 

cttcatcaag attcccgtgg t 



<210> 108 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 108 

caacttcatg attcccgtgg t 



<210> 109 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 109 

aagattcccg tggtcccgca c 



<210> 110 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 110 

aagattcccg cggtcccgca c 



<210> 111 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 111 

cccgtggtcc cgcacaatga g 



<210> 112 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 112 

cccgtggtcc tgcacaatga g 



<210> 113 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 113 

tgctgtgtgc gggcatcctc g 



<210> 114 

<2H> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 114 

tgctgtgtgc aggcatcctc g 



<210> 115 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 115 

gtgagctggg gtgagggctg t 



<210> 116 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 116 

gtgagctggg tgagggctgt g 



<210> 117 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 117 

cggcgtttac accaaagtca g 



<210> 118 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 118 

cggcgtttac cccaaagtca g 
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<210> 119 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 119 

tcgactggat ccatgggcac a 

<210> 120 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 120 

tcgactggat gcatgggcac a 



<210> 121 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 121 

gatccgcaaa cgtgccaact c 



<210> 122 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 122 

gatccgcaaa tgtgccaact c 



<210> 123 

<211> 21 

<212> DNA 

<213> Artificial sequence 



26 



<220> 

<223> oligonucleotide probe 
<400> 123 

atccgcaaac gtgccaactc c 



<210> 124 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 124 

atccgcaaac atgccaactc c 



<210> 125 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 125 

cagcctggag cgggagtgca t 



<210> 126 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 126 

cagcctggag tgggagtgca t 



<210> 127 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 127 

agtagatccg cggctcattg a 



:210> 128 



<211> 
<212> 
<213> 



21 
DNA 

Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 128 

agtagatccg tggctcattg a 21 



<210> 129 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 129 

gtagatccgc ggctcattga t 



<210> 130 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 130 

gtagatccgc agctcattga t 



<210> 131 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 131 

cagcggccca ctgcatggat g 



<210> 132 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 132 

cagcggccca atgcatggat g 
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<210> 133 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 133 

tacagcaaga gcaccaccga c 21 



<210> 134 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 134 

tacagcaaga acaccaccga c 



<210> 135 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 135 

ctgagaacat gctgtgtgcg g 



<210> 136 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 136 

ctgagaacat actgtgtgcg g 



<210> 137 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



29 



<400> 137 

cctcggggac cggcaggatg c 
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<210> 138 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 138 

cctcggggac tggcaggatg c 

<210> 139 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 139 

ctgcgagggc gacagtgggg g 



<210> 140 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 140 

ctgcgagggc aacagtgggg g 



<210> 141 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 141 

tcacaactac ggcgtttaca c 



<210> 142 

<211> 21 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> oligonucleotide probe 
<400> 142 

tcacaactac agcgtttaca c 



<210> 143 
<211> 21 
<212> DNA 

<213> - Artificial sec[uence 

<220> 

<223> oligonucleotide probe 
<400> 143 

ccctgctgga tggcatcctt g 



<210> 144 

<211> 21 

<212> DNA 

<213> Artificial secjuence 
<220> 

<223> oligonucleotide probe 

<400> 144 

ccctgctgga cggcatcctt g 



<210> 145 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 145 

catccttggt aggcagaggt g 



<210> 146 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 146 

catccttggt gggcagaggt g 



<210> 147 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 147 

ggtgggcgct gcggggcgct g 



<210> 148 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 148 

ggtgggcgct cggggcgctg c 



<210> 149 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 149 

ctcctcctag tgcttcccgt c 



<210> 150 

<211> 21 

<212> DNA 

<213> Artificial sec[uence 
<220> 

<223> oligonucleotide probe 

<400> 150 

ctcctcctag agcttcccgt c 



<210> 151 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<400> 151 

agaatgcatc gaagaactgt g 



<210> 152 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 152 

agaatgcatc taagaactgt g 



<210> 153 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 153 

aataaagaag aagccaggga g 



<210> 154 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 154 

aataaagaag cagccaggga g 



<210> 155 
<2H> 21 
<212> DNA 

<213> Artificial sequence 
<220> . 

<223> oligonucleotide probe 
<400> 155 

tatccaaaat acttaggtaa g 



<210> 156 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 156 

tatccaaaat aggtaagttc a 
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<210> 157 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 157 

aaaatactta ggtaagttca a 

<210> 158 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 158 

aaaatactta cgtaagttca a 



<210> 159 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 159 

tgtgtcaatg gtaagcactt c 



<210> 160 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 160 

tgtgtcaatg ataagcactt c 



<210> 161 

<211> 21 

<212> DNA 

<213> Artificial sequence 



34 



<220> 

<223> oligonucleotide probe 
<400> 161 

gctttcaaat aagaaagatt g 



<210> 162 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 162 

gctttcaaat gagaaagatt g 



<210> 163 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 163 

ctttgtgtca attaccctgg a 



<210> 164 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 164 

ctttgtgtca gttaccctgg a 



<210> 165 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 165 

tgcccaagat cagaagagtt g 



:210> 166 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 166 

tgcccaagat tagaagagtt g 



<:210> 167 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 167 

gcggagcagt ttgcaggggt t 



<210> 168 

<211> 21 

<212> DNA 

<213> Artificial sec[uence 
<220> 

<223> oligonucleotide probe 

<400> 168 

gcggagcagt tttgcagggg t 



<210> 169 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 169 

acatatgatt cagaaggcgt g 



<210> 170 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 170 

acatatgatt gagaaggcgt g 



<210> 171 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 171 

gattcagaag gcgtgatact g 



<210> 172 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 172 

gattcagaag tcgtgatact g 



<210> 173 

<2H> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



aatatggtac gtttgcagat t 



<210> 174 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 174 

aatatggtac atttgcagat t 



<210> 175 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<400> 175 

gtctgtggaa gaattagaac a 
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<210> 176 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 176 

gtctgtggaa aaattagaac a 



<210> 177 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 177 

cctggacccc tttttaagcc g 



<210> 178 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 178 

cctggaccca aaagccggaa a 



<210> 179 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 179 

ttgtatttta atttgttaga t 



<210> 180 

<211> 21 

<212> DNA 

<213> Artificial sequence 



38 



<220> 

<223> oligonucleotide probe 
<400> 180 

ttgtatttta gtttgttaga t 



<210> 181 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 181 

aaccctcgtc tagatggatg t 



<210> 182 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 182 

aaccctcgtc cagatggatg t 



<210> 183 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 183 

tggatgtata cgaagctgga a 



<210> 184 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 184 

tggatgtata tgaagctgga a 



<210> 185 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 185 

gaaaaacaaa ataagcattg c 



<210> 186 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 186 

gaaaaacaaa aataagcatt g 



<210> 187 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 187 

ctgagggttg gcatgtaaat g 



<210> 188 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 188 

ctgagggttg acatgtaaat g 



<210> 189 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 189 

cttgaatatt cgtccatcca c 



<210> 190 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 190 

cttgaatatt tgtccatcca c 



<210> 191 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 191 

atatcggata caggccctaa g 



<210> 192 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> oligonucleotide probe 

<400> 192 

atatcggata taggccctaa g 



<210> 193 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 193 

caatctcatc tggaatttag a 



<210> 194 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 194 

caatctcatc gggaatttag a 
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<210> 195 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 195 

agctcactca tgtccatcag t 



<210> 196 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 196 

agctcactca cgtccatcag t 



<210> 197 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 197 

gttaggaagc gtcgtgcaaa t 



<210> 198 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 198 

gttaggaagc ttcgtgcaaa t 



<210> 199 

<211> 21 

<212> DNA 

<213> Artificial sequence 



42 



<220> 

<223> oligonucleotide probe 
<400> 199 

tgaagaaacc aaacagggta a 



<210> 200 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 200 

tgaagaaacc gaacagggta a 



<210> 201 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 201 

aatttggtac gtataataac c 



<210> 202 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 202 

aatttggtac atataataac c 



<210> 203 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 203 

tgtctcctcc tagtgcttcc c 



<210> 204 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 204 

tgtctcctcc ttagtgcttc c 21 



<210> 205 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 205 

ggttggcatg taaatgtgac c 



<210> 206 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 206 

ggttggcatg gaaatgtgac c 



<210> 207 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 207 

aaaaagacaa agaattctta a 



<210> 208 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 208 

aaaaagacag aattcttaag g 



44 



<210> 209 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 209 

agaattctta aggcatcttt t 



<210> 210 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 210 

agaattctta tggcatcttt t 



<210> 211 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 211 

gaaaatgacc cggaaacggt a 



<210> 212 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 212 

gaaaatgacc tggaaacggt a 



<210> 213 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 213 

gaaacggtaa gcatttatgg a 
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<210> 214 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 214 

gaaacggtaa acatttatgg a 



<210> 215 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 215 

cagaatctat cgatcactca g 



<210> 216 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 216 

cagaatctat tgatcactca g 



<210> 217 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 217 

ttgtctgtaa cagatttgaa t 



<210> 218 

<211> 21 

<212> DNA 

<213> Artificial sequence 



46 



<220> 

<223> oligonucleotide probe 
<400> 218 

ttgtctgtaa tagatttgaa t 



<210> 219 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 219 

agataatgta tccagtgctg a 



<210> 220 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 220 

agataatgta cccagtgctg a 

<210> 221 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 221 

actcatgtcc atcagtttgg a 



<210> 222 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 222 

actcatgtcc gtcagtttgg a 



<210> 223 



<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 223 

tttctctgct tataatacct t 



<210> 224 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<22 3> oligonucleotide probe 

<400> 224 

tttctctgct gataatacct t 



<210> 225 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 225 

ctcaggggga ccagctttgg c 



<210> 226 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 226 

ctcaggggga acagctttgg c 



<210> 227 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 227 

tccagcaact gatggaggta c 



<210> 228 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 228 

tccagcaact tgatggaggt a 



<210> 229 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 229 

ggtatttaag tttgacacca t 



<210> 230 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 230 

gtatttgaca ccatatctga g 



<210> 231 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 231 

tccacttctt ctttgccaaa c 



<210> 232 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 232 

tccacttctt tgccaaactg a 
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<210> 233 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 233 

actgaactgc cgactctatc g 



<210> 234 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 234 

actgaactgc tgactctatc g 



<210> 235 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 235 

ccgactctat cgaaaagcca a 



<210> 236 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 236 

ccgactctat tgaaaagcca a 



<210> 237 

<211> 21 

<212> DNA 

<213> Artificial sequence 



50 



<220> 

<223> oligonucleotide probe 
<400> 237 

gagttggtat atggagccaa g 



<210> 238 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 238 

gagttggtat gtggagccaa g 



<210> 239 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 239 

gaccgaaggc cgaatcaccg a 



<210> 240 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 240 

gaccgaaggc tgaatcaccg a 



<210> 241 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 241 

agacttctcc ggtcttcctt c 



<210> 242 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 242 

agacttctcc agtcttcctt c 



<210> 243 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 243 

tgcagaaggc cgagatgacc t 



<210> 244 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 244 

tgcagaaggc tgagatgacc t 



<210> 245 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 245 

acaggccttt cctggttttt a 



<210> 246 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 246 

acaggccttt acctggtttt t 



<210> 247 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 247 

aacactatta tcttcatggg c 



<210> 248 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 248 

aacactatta ccttcatggg c 



<210> 249 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 249 

agcggccatc aacaaatggg t 



<210> 250 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 250 

agcggccatc gacaaatggg t 



<210> 251 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 251 

aggcagtgaa gcagctgcaa g 
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<210> 252 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 252 

aggcagtgaa acagctgcaa g 



<210> 253 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 253 

tgaagcagct gcaagtaccg c 



<210> 254 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



tgaagcagct ccaagtaccg c 



<210> 255 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 255 

tgaagcagct gcaagtaccg c 



<210> 256 

<211> 21 

<212> DNA 

<213> Artificial sequence 



54 



<220> 

<223> oligonucleotide probe 
<400> 256 

tgaagcagct tcaagtaccg c 



<210> 257 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 257 

gattgctggc cgttcgctaa a 



<210> 258 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 258 

gattgctggc tgttcgctaa a 



<210> 259 

<211> 21 

<212> DNA- 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 259 

attgctggcc gttcgctaaa c 



<210> 260 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 260 

attgctggcc attcgctaaa c 



:210> 261 



<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 261 

gctggccgtt cgctaaaccc c 



<210> 262 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 262 

gctggccgtt tgctaaaccc c 



<210> 263 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 263 

cctgtggaca tctgcacagc c 



<210> 264 
<211> 21 
<212> DNA 

<213> . Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 264 

cctgtggaca actgcacagc c 



<210> 265 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 265 

cagaagatcc cggaggccac c 



<210> 266 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 266 

cagaagatcc tggaggccac c 



<210> 267 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 267 

caccaaccgg cgtgtctggg a 



<210> 268 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 268 

caccaaccgg tgtgtctggg a 



<210> 269 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 269 

accaaccggc gtgtctggga a 



<210> 270 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 



<400> 270 

accaaccggc atgtctggga a 



21 



<210> 271 

<2il> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 271 

taatgacacc ctccagcaac t 



<210> 272 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 272 

taatgacacc ttccagcaac t 

<210> 273 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 273 

ctgaactgcc gactctatcg a 



<210> 274 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 274 

ctgaactgcc aactctatcg a 



<210> 275 

<211> 21 

<212> DNA 

<213> Artificial sequence 



58 



<220> 

<223> oligonucleotide probe 
<400> 275 

gggtgacttt caaggccaac a 



<210> 276 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 276 

gggtgacttt aaaggccaac a 



<210> 277 

<211> 21 

<212> DNA 

<213> Artificial sec[uence 
<220> 

<223> oligonucleotide probe 

<400> 277 

gactttcaag gccaacaggc c 



<210> 278 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 278 

gactttcaag accaacaggc c 



<210> 279 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 279 

tcaaggccaa caggcctttc c 



:210> 280 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 280 

tcaaggccaa gaggcctttc c 



<210> 281 

<211> 21 

<212> DNA 

<213> Artificial sec[uence 
<220> 

<223> oligonucleotide probe 

<400> 281 

ggccaacagg cctttcctgg t 



<210> 282 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 282 

ggccaacagg actttcctgg t 



<210> 283 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 283 

gccaacaggc ctttcctggt t 



<210> 284 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 284 

gccaacaggc ttttcctggt t 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 
<400> 285 

ttcatgggca gagtagccaa c 



<210> 286 

<211> 21 

<212> DNA 

<213> Artificial sec[uence 

<220> 

<223> oligonucleotide probe 

<400> 286 

ttcatgggca cagtagccaa c 



<210> 287 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 287 

tgagagaggg aattacaggt a 



<210> 288 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 288 

tgagagaggg tgggtatgaa c 



<210> 289 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 



<400> 289 

tggccaaggt ggagaaggaa c 



<210> 290 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 290 

tggccaaggt agagaaggaa c 



<210> 291 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 291 

aggtgctgca ggagtggctg g 



<210> 292 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 292 

aggtgctgca agagtggctg g 



<210> 293 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 293 

tgggcccagc tagcactggt g 



<210> 294 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> oligonucleotide probe 
<400> 294 

tgggcccagc cagcactggt g 



<210> 295 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> oligonucleotide probe 

<400> 295 

cactctctta cggtacagaa a 



<210> 296 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 296 

cactctctta gggtacagaa a 



<210> 297 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 297 

gcctatacag tcactttttt t 



<210> 298 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 298 

gcctatacag tcacttttat g 



:210> 299 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 299 

tctgcgggag ccgatttcat c 



<210> 300 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 300 

tctgcgggag tcgatttcat c 



<210> 301 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 301 

accagtgaag aaagtgtctt t 



<210> 302 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> oligonucleotide probe 

<400> 302 

accagtgaag caagtgtctt t 
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^ A check in the amount of $ 1 60.00 to cover the filing fee is enclosed. 

^ The Commissioner is hereby authorized to charge any additional fees which may be required 
in connection with the filing of this provisional application and recording any assignment 
filed herewith, or credit over-payment, to Account No. 02-4550. A copy of this sheet is 
enclosed. 
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